Renormalization 


JOHN COLLINS 


CAMERIDE BIN CMRP SG cre 
SATO ATICAL PETHS 


CAMBRIDGE MONOGRAPHS 
ON MATHEMATICAL PHYSICS 


General Editors: P. V. Landshoff, D. R. Nelson, D. W. Sciama, S. Weinberg 


RENORMALIZATION 


This title, first published in 1984, has been reissued as an Open Access 
publication on Cambridge Core. 


RENORMALIZATION 


An introduction to renormalization, the renormalization group, and the 
operator-product expansion 


JOHN C.COLLINS 


Illinois Institute of Technology 


CAMBRIDGE 


W UNIVERSITY PRESS 


=| CAMBRIDGE 


UNIVERSITY PRESS 


Shaftesbury Road, Cambridge CB2 8EA, United Kingdom 
One Liberty Plaza, 20th Floor, New York, NY 10006, USA 
477 Williamstown Road, Port Melbourne, VIC 3207, Australia 
314-321, 3rd Floor, Plot 3, Splendor Forum, Jasola District Centre, New Delhi — 110025, India 
103 Penang Road, #05-06/07, Visioncrest Commercial, Singapore 238467 


Cambridge University Press is part of Cambridge University Press & Assessment, 
a department of the University of Cambridge. 


We share the University’s mission to contribute to society through the pursuit of 
education, learning and research at the highest international levels of excellence. 


www.cambridge.org 
Information on this title: www.cambridge.org/978 1009401760 


DOI: 10.1017/978 1009401807 
© Cambridge University Press 1984, 2023 


This work is in copyright. It is subject to statutory exceptions and to the provisions 
of relevant licensing agreements; with the exception of the Creative Commons version the 
link for which is provided below, no reproduction of any part of this work may take 
place without the written permission of Cambridge University Press. 


An online version of this work is published at doi.org/10.1017/9781009401807 under a 
Creative Commons Open Access license CC-BY-NC-ND 4.0 which permits re-use, 
distribution and reproduction in any medium for non-commercial purposes providing appropriate 
credit to the original work is given. You may not distribute derivative works 
without permission. To view a copy of this license, visit 
https://creativecommons.org/licenses/by-nc-nd/4.0 


All versions of this work may contain content reproduced under license from third parties. 
Permission to reproduce this third-party content must be obtained from these third-parties directly. 


When citing this work, please include a reference to the DOI 10.1017/9781009401807 


First published 1984 
First paperback printing 1985 
Reprinted 1987, 1989, 1992, 1995, 1998 
Reissued as OA 2023 


A catalogue record for this publication is available from the British Library. 


ISBN 978-1-009-40176-0 Hardback 
ISBN 978-1-009-40179-1 Paperback 


Cambridge University Press & Assessment has no responsibility for the persistence or accuracy of 
URLs for external or third-party internet websites referred to in this publication 
and does not guarantee that any content on such websites is, or will remain, 
accurate or appropriate. 


Contents 


Introduction 


Quantum field theory 

Scalar field theory 

Functional-integral solution 
Renormalization 

Ultra-violet regulators 

Equations of motion for Green’s functions 
Symmetries 

Ward identities 

Perturbation theory 

Spontaneously broken symmetry 
Fermions 

Gauge theories 

Quantizing gauge theories 

BRS invariance and Slavnov—Taylor identities 
Feynman rules for gauge theories 

Other symmetries of (2.11.7) 

Model field theories 


Basic examples 

One-loop self-energy in ¢* theory 
Wick rotation 

Lattice 

Interpretation of divergence 
Computation 

Higher order 

Degree of divergence 

ġ at d=6 


Why may Z be zero and yet contain divergences? 


Renormalizability and non-renormalizability 
Renormalization group 

Arbitrariness in a renormalized graph 
Renormalization prescriptions 

Dimensional regularization 

Minimal subtraction 


Contents 


Definition 

d=6 

Renormalization group and minimal subtraction 
Massless theories 

Coordinate space 


Dimensional regularization 
Definition and axioms 
Continuation to small d 
Properties 

Formulae for Minkowski space 
Schwinger parameters 
Feynman parameters 

Dirac matrices 


Ys and Ekàuv 


Renormalization 

Divergences and subdivergences 

Two-loop self-energy in (¢°), 

Fig. 5.1.2 

Differentiation with respect to external momenta 

Fig. 5.1.3 

Renormalization of Feynman graphs 
One-particle-irreducible graph with no subdivergences 
General case 

Application of general formulae 

Summary 

Three-loop example 

Forest formula 

Formula 

Proof 

Relation to Z 

Renormalizability 

Renormalizability and non-renormalizability 
Cosmological term 

Degrees of renormalizability 

Non-renormalizability 

Relation of renormalizability to dimension of coupling 
Non-renormalizable theories of physics 

Proof of locality of counterterms; Weinberg’s theorem 
Degree of counterterms equals degree of divergence 
R(G) is finite if 6(G) is negative 

Asymptotic behavior 

Oversubtractions 

Mass-shell renormalization and oversubtraction 


56 
57 
58 
59 
59 


62 
64 
68 
73 
81 
81 
83 
83 
86 
88 
89 
93 
94 
97 
99 
101 
102 
103 
105 
106 
106 
109 
109 
110 
112 
116 
116 
118 
118 
120 
12] 
122 
125 
126 
127 
129 
130 
130 


Contents 


Remarks 
Oversubtraction on 1PR graphs 


Renormalization without regulators: the BPHZ scheme 


Minimal subtraction 

Definition 

MS renormalization 

Minimal subtraction with other regulators 


Composite operators 

Operator-product expansion 

Renormalization of composite operators: examples 
Renormalization of $7 

Renormalization of ¢7(x)¢7(y) 

Definitions 

Operator mixing 

Tensors and minimal subtraction 

Properties 

Differentiation with respect to parameters in Z 
Relation of renormalizations of ¢? and m? 


Renormalization group 

Change of renormalization prescription 
Change of parametrization 
Renormalization-prescription dependence 
Low-order examples 

Proof of RG invariance 
Renormalization-group equation 
Renormalization-group coefficients 

RG equation 

Solution 

Large-momentum behavior of Green’s functions 
Generalizations 

Varieties of high- and low-energy behavior 
Asymptotic freedom 

Maximum accuracy in an asymptotically free theory 
Fixed point theories 

Low-energy behavior of massless theory 
Leading logarithms, etc. 
Renormalization-group logarithms 
Non-renormalization-group logarithms 
Landau ghost 

Other theories 

Other renormalization prescriptions 
Dimensional transmutation 

Choice of cut-off procedure 


vii 


131 
131 
133 
135 
135 
137 
137 


138 
139 
142 
142 
145 
146 
149 
150 
152 
163 
166 


168 
169 
169 
171 
172 
176 
180 
180 
183 
184 
185 
187 
187 
187 
189 
190 
191 
193 
193 
196 
196 
197 
200 
203 
206 


Contents 


Example: ¢* self-energy 

RG coefficients 

Computation of g, and Z; asymptotically free case 
Accuracy needed for gy 

mo 

Non-asymptotically free case 

Computing renormalization factors using dimensional 
regularization 

Renormalization group for composite operators 


Large-mass expansion 

A model 

Power-counting 

Tree graphs 

Finite graphs with heavy loops 

Divergent one-loop graphs 

More than one loop 

General ideas 

Renormalization prescriptions with manifest decoupling 
Dominant regions 

Proof of decoupling 

Renormalization prescription R* with manifest decoupling 
Definition of R* 

IR finiteness of C*(T) 

Manifest decoupling for R* 

Decoupling theorem 

Renormalization-group analysis 

Sample calculation 

Accuracy 


Global symmetries 

Unbroken symmetry 
Spontaneously broken symmetry 
Proof of invariance of counterterms 
Renormalization of the current 
Infra-red divergences 
Renormalization methods 


Generation of renormalization conditions by Ward identities 


Operator-product expansion 
Examples 

Cases with no divergences 
Divergent example 
Momentum space 

Fig. 10.1.3 inside bigger graph 


208 
208 
209 
213 
216 
216 


217 
219 


222 
224 
225 
225 
226 
226 
229 
230 
231 
232 
233 
233 
236 
237 
239 
239 
240 
242 
243 


244 
245 
247 
249 
251 
252 
253 
254 


257 
258 
258 
260 


262 
262 


10.2 
10.3 
10.3.1 
10.3.2 
10.3.3 
10.3.4 
10.4 
10.5 


11 
11.1 
11.1.1 
11.1.2 
11.2 
11.3 
11.4 
12 
12.1 
12.2 
12.2.1 
12.2.2 
12.2.3 
12.2.4 
12.2.5 
12.2.6 
12.2.7 
12.2.8 
12.2.9 
12.3 
12.3.1 
12.3.2 
12.3.3 
12.4. 
12.4.1 
12.4.2 
12.5 
12.6 
12.6.1 
12:7 
12.8 
129 
12.9.1 
12.9.2 
12.9.3 


Contents 


Strategy of proof 

Proof 

Construction of remainder 

Absence of infra-red and ultra-violet divergences in r(T) 
R(T) —r(I) is the Wilson expansion 

Formula for C(U) 

General case 

Renormalization group 


Coordinate space 

Short-distance singularities of free propagator 
Zero temperature 

Non-zero temperature 

Construction of counterterms in low-order graphs 
Flat-space renormalization 

External fields 

Renormalization of gauge theories 

Statement of results 

Proof of renormalizability 

Preliminaries 

Choice of counterterms 

Graphs with external derivatives 

Graphs finite by equations of motion 

Gluon self-energy 

BRS transformation of A‘ 

Gluon self-interaction 

Ope 

Quark~—gluon interaction, dy, dy; introduction of 2, 
More general theories 

Bigger gauge group 

Scalar matter 

Spontaneous symmetry breaking 

Gauge dependence of counterterms 

Change of č 

Change of F, 

R,-gauge 

Renormalization of gauge-invariant operators 
Caveat 

Renormalization-group equation 
Operator-product expansion 

Abelian theories: with and without photon mass 
BRS treatment of massive photon 

Elementary treatment of abelian theory with photon mass 
Ward identities 


Contents 


Counterterms proportional to A? and ¢: A? 
Relation between e, and Z, 

Gauge dependence 

Renormalization-group equation 

Unitary gauge for massive photon 


Anomalies 

Chiral transformations 

Definition of y, 

Properties of axial currents 

Non-anomalous currents 

Anomalous currents 

Chiral gauge theories 

Supersymmetric theories 

Ward identity for bare axial current 
Renormalization of operators in Ward identities 
One-loop calculations 

Non-singlet axial current has no anomaly 
Reduction of anomaly 

Renormalized current has no anomalous dimension 
Three-current Ward identity: the triangle anomaly 
General form of anomaly 

One-loop value 

Higher orders 


Deep-inelastic scattering 

Kinematics, etc. 

Parton model 

Dispersion relations and moments 
Expansion for scalar current 

Calculation of Wilson coefficients 
Lowest-order Wilson coefficients 
Anomalous dimensions 

Solution of RG equation — non-singlet 
OPE for vector and axial currents 
Wilson coefficients — electromagnetic case 
Parton interpretation of Wilson expansion 
W, and W, 

References 


Index 


326 
327 
327 
329 
330 
331 
333 
334 
338 
338 
338 
339 
339 
339 
340 
342 
346 
346 
347 
349 
349 
351 
352 


354 
355 
358 
359 
361 
364 
364 
365 
367 
367 
368 
369 
371 
372 


377 


1 


Introduction 


The structure of a quantum field theory often simplifies when one considers 
processes involving large momenta or short distances. These simplifi- 
cations are important in improving one’s ability to calculate predictions 
from the theory, and in essence form the subject of this book. 

The first simplification to be considered involves the very existence of the 
theory. The problem is that there are usually ultra-violet divergences caused 
by large fluctuations of the field(s) on short distance scales. These manifest 
themselves in Feynman graphs as divergences when loop momenta go to 
infinity with the external momenta fixed. The simplification is that the 
divergences can be cancelled by renormalizations of the parameters of the 
action. Consequently our first task will be to treat the ultra-violet 
renormalizations. Renormalization is essential, for otherwise most field 
theories do not exist. 

We will then expose the methods needed to handle high-energy/short- 
distance problems. The aim is to be able to make testable predictions from a 
strong interaction theory, or to improve the rate of convergence of the 
perturbation expansion in a weakly coupled theory. The simplifications 
generally take the form of a factorization of a cross-section or of an 
amplitude, each factor containing the dependence of the process on 
phenomena that happen on one particular distance scale. Such a factori- 
zation is useful, because the coefficients of the perturbation expansion for a 
process are large when the process involves widely different distance scales. 

The industry called ‘perturbative QCD’ consists of deriving such 
factorization theorems for strong interactions (Mueller (1981)) and explor- 
ing their phenomenological consequences. We will only study the earliest of 
these factorizations, the operator product expansion of Wilson (1969). We 
will also discuss the theorems that describe the behavior of a theory when 
the masses of its fields get large (Appelquist & Carazzone (1975) and Witten 
(1976)). These large-mass theorems have their main uses in weak in- 
teraction theories. 

The presence of ultra-violet divergences, even though they are cancelled 
by renormalization counterterms, means that in any process there are 
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contributions from quantum fluctuations on every distance scale. This is 
both a complication and an opportunity to find interesting physics. The 
complication is that the derivation of factorization theorems is made 
difficult. The opportunity is given by the observable phenomena that 
directly result from the existence of the divergences. A standard example is 
given by the scaling violations in deep-inelastic scattering. 

It is the renormalization group (Stueckelberg & Petermann (1953) and 
Gell-Mann & Low (1954)) that is the key technique in disentangling the 
complications. The infinite parts of the counterterms are determined by the 
requirement that they cancel the divergences, but the finite parts are not so 
determined. In fact, the partition of a bare coupling g, into the sum of a 
finite renormalized coupling gg and a singular counterterm Ag is arbitrary. 
One can reparametrize the theory by transferring a finite amount from gp to 
Ag without changing the physics: the theory is renormalization-group 
invariant. 

This trivial-sounding observation is in practice very useful, and far from 
trivial. Suppose one has some graph whose renormalized value is large (so 
that it is inadequate to use a few low orders of the perturbation theory to 
compute the corresponding quantity). Then in appropriate circumstances it 
is possible to adjust the partition of gy (viz., go = gr + Ag) so that the 
counterterm Ag cancels not only the divergence but also the excessively 
large piece of the graph’s finite part. The large piece is now in the lowest 
order instead of higher orders. Construction of factorization theorems of 
the sort reviewed by Mueller (1981) provides many circumstances where 
this trick is applicable. Without it the factorization theorems would be 
almost powerless. 

We see that the subjects of renormalization, the renormalization group, 
and the operator product expansion are intimately linked, and we will treat 
them all in this book. The aim will be to explain the general methods that 
are applicable not only to the examples we will examine but in many other 
situations. We will not aim at complete rigor. However there are many 
pitfalls and traps ready to ensnare an unwary physicist. Thus a precise set of 
concepts and notations is necessary, for many of the dangers are essentially 
combinatorial. The appropriate basis is then that of Zimmermann (1969, 
1970, 1973a, 1973b). 

One other problem is that of choice of an ultra-violet cut off. From a 
fundamental point of view, the lattice cut-off seems best as it appears in non- 
perturbative treatments using the functional integral (e.g., Glimm & Jaffe 
(1981)). In perturbation theory one can arrange to use no regulator 
whatsoever (e.g., Piguet & Rouet (1981)). In practice, dimensional reg- 
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ularization has deservedly become very popular. This consists of replacing 
the physical space-time dimensionality 4 by an arbitrary complex number d. 
The main attraction of this method is that virtually no violence is done to 
the structure of a Feynman graph; a second attraction is that it also 
regulates infra-red divergences. The disadvantage is that the method has 
not been formulated outside of perturbation theory (at least not yet). Much 
of the treatment in this book, especially the examples, will be based on the 
use of dimensional regularization. However it cannot be emphasized too 
strongly that none of the fundamental results depend on this choice. 
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Quantum field theory 


Most of the work in this book will be strictly perturbative. However it is 
important not to consider perturbation theory as the be-all and end-all of 
field theory. Rather, it must be looked on only as a systematic method of 
approximating a complete quantum field theory, with the errors under 
control. So in this chapter we will review the foundations of quantum field 
theory starting from the functional integral. 

The purpose of this review is partly to set out the results on which the rest 
of the book is based. It will also introduce our notation. We will also list a 
number of standard field theories which will be used throughout the book. 
Some examples are physical theories of the real world; others are simpler 
theories whose only purpose will be to illustrate methods in the absence of 
complications. 

The use of functional integration is not absolutely essential. Its use is to 
provide a systematic basis for the rest of our work: the functional integral 
gives an explicit solution of any given field theory. Our task will be to 
investigate a certain class of properties of the solution. 

For more details the reader should consult a standard textbook on field 
theory. Of these, probably the most complete and up-to-date is by Itzykson & 
Zuber (1980); this includes a treatment of the functional integral method. 
Other useful references include: Bjorken & Drell (1966), Bogoliubov & 
Shirkov (1980), Lurie (1968), and Ramond (1981). 


2.1 Scalar field theory 


The simplest quantum field theory is that of a single real scalar field ¢(x*). 
The theory is defined by canonically quantizing a classical field theory. This 
classical theory is specified by a Lagrangian density: 


L =(0¢)*/2 — P(d), (2.1.1) 
from which follows the equation of motion 
Lid + P(g) =0. (2.1.2) 
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2.1 Scalar field theory 5 


Here P(¢ġ) is a function of @(x), which we generally take to be a polynomial 
like P(ġ) = m?$7/2 + go*/4!, and P'(ġ) =dP/d@. (Note that we use units 
with h=c=1.) 

In the Hamiltonian formulation of the same theory, we define a canonical 
momentum field: 


n(x) = 0L£/db = ¢ = 0g/ét, (2.1.3) 


and the Hamiltonian | 
H= [axana + Vo2/2 + P(¢)). (2.1.4) 


Physically, we require that a theory have a lowest energy state. If it does not 
then all states are unstable against decay into a lower energy state plus a 
collection of particles. If the function P(@) has no minimum, then the 
formula (2.1.4) implies that just such a catastrophic situation exists (Baym 
(1960)). Thus we require the function P(ġ) to be bounded below. 

Quantization proceeds in the Heisenberg picture by reinterpreting (x) 
as a hermitian operator on a Hilbert space satisfying the canonical equal- 
time commutation relations, i.e., 


[ x(x), (y)] = — 16% — 9), l ‘fF 0,0 2.1.5 
mosolon- A (2.1.5) 


The Hamiltonian is still given by (2.1.4) so the equation of motion (2.1.2) 
follows from the Heisenberg equation of motion 


iôġ/ôt = [¢ġ, H]. (2.1.6) 

A solution to the theory is specified by stating what the space of states is 

and by giving the manner in which ¢ acts on the states. We will construct a 

solution by use of the functional integral. It should be noted that (x) 1s in 

general not a well-behaved operator, but rather it is an operator-valued 

distribution. Physically that means that one cannot measure ġ(x)at a single 
point, but only averages of d(x) over a space-time region. That is, 


os = | b(x) f (x)d*x, (2.1.7) 


for any complex-valued function f(x), is an operator. Now, products of 
distributions do not always make sense (e.g., 6(x)?). In particular, the 
Hamiltonian H involves products of fields at the same point. Some care is 
needed to define these products properly; this is, in fact, the subject of 
renormalization, to be treated shortly. 

The following properties of the theory are standard: 
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(1) The theory has a Poincaré-invariant ground state |O>, called the 
vacuum. 

(2) The states and the action of ¢ on them can be reconstructed from the 
time-ordered Green’s functions 


Gy(X15---sXy) = €0| T(x)... A(Xp)/0. (2.1.8) 


The T-ordering symbol means that the fields are written in order of 
increasing time from right to left. 

(3) The Green’s functions have appropriate causality properties, etc., so 
that they are the Green’s functions of a physically sensible theory. 
Mathematically, these properties are summarized by the Wightman 
axioms (Streater & Wightman (1978)). 


Bose symmetry of the ¢-field means that the Green’s functions are 
symmetric under interchange of any of the x’s. From the equations of 
motion of ġ and from the commutation relations can be derived equations 
of motion for the Green’s functions. The simplest example is 


1,G2(y, x) + <0| TPAO) = —i5(x—y). (2.1.9) 


For a general (N + 1)-point Green’s function, we have N 6-functions on the 
right: 


Ty Gu 4 XXn) + CO TP'(P(y)) (x1). -- (Xn) | 


N 
= —i $, OMY — x)Gy_ (045-2. Xj 45 Xto Xy) (2.1.10) 
j=1 


This equation summarizes both the equations of motion and the com- 
mutation relations. Solving the theory for the Green’s functions means in 
essence solving this set of coupled equations. It is in fact the Green’s 
functions that are the easiest objects to compute. All other properties of the 
theory can be calculated once the Green’s functions are known. 


2.2 Functional-integral solution 


The solution of a quantum field theory is a non-trivial problem in 
consistency. Only two cases are elementary: free field theory (P = m*7/2), 
and the case of one space-time dimension, d = 1. The case d = 1 is a rather 
trivial field theory, for it is just the quantum mechanics of a particle with 
Heisenberg position operator @(t) in a potential P(@). (In Section 2.1, we 
explained the case d = 4. It is easy to go back and change the formulae to be 
valid for a general value of d.) 
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For the case of ¢* theory, with 


P(d) =m p?/2 + 99*/4!, (2.2.1) 
solutions are rigorously known to exist if d = 2 or 3 (Glimm & Jaffe (1981)). 
If d > 4 then no non-trivial solution exists (Aizenman (1981)). The case d = 4 
is difficult; the difficulty is to perform renormalization of the ultra-violet 
divergences beyond perturbation theory. As we will see the theory at d = 4 
is ‘exactly renormalizable’ in perturbation theory; this is the most 
interesting case. For the most part we will ignore the difficulties in going 
beyond perturbation theory. We will return to this problem in Section 7.10 
when we discuss the application of the renormalization group outside of 
perturbation theory. 
If we ignore, temporarily, the renormalization problem, then a solution 
for the theory can be found in terms of a functional integral. The formula for 
the Green’s functions is written as 


CA rete se Loe, [foa]e40s,) ..A(Xy). (2.2.2) 


(See Chapter 9 of Itzykson & Zuber (1980), or see Glimm & Jaffe (1981).) On 
the right-hand side of this equation A(x) represents a classical field, and the 
integration is over the value of A(x) at every space-time point. The result of 
the integral in (2.2.2) is the N-point Green’s function for the corresponding 
quantum field, ¢. In the integrand appears the classical action, which is 


S[A] = | ELA (2.2.3) 


The normalization factor M is to give <0|0> = 1, so that 


N= | | jaaa] (2.2.4) 


Equivalent to (2.2.2) is the integral for the generating functional of 
Green’s functions: 


Z[J]= v | [dA]exp fista] re | atxa, (2.2.5) 


where J(x)is an arbitrary function. Functionally differentiating with respect 
to J(x) gives the Green’s functions, e.g., 
(2.2.6) 


1 ô? 
COTA = 3709 zarg 


It is somewhat delicate to make precise the definition of the integration 
over A. The principal steps are: 


(J =0)- 
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(1) ‘Wick-rotate’ time to imaginary values: t = — it, so that space-time is 
Euclidean. The exponent in the integral is then: 


— Smala] = — | dt d?x[ — 42/2 + P(A)]. (2.2.7) 


With our metric, we have 0A? = — (0A/dt)? — VA”. We may subtract 
out from Z the minimum value of P(A); this subtraction gives an 
overall factor in the functional integral, and it cancels between the 
integral and the normalization factor (2.2.4). Therefore the Euclidean 
action S,,,., IS positive definite. The factor exp(— Stuc) gives much 
better convergence for large A and for rapidly varying A than does 
exp (iS) in Minkowski space. 

(2) Replace space-time by a finite lattice. We may choose a cubic lattice 
with spacing a. Its points are then 


x“ = n"a. 
where the n”’s are integers. They are bounded to keep x inside a spatial 


box of volume V and to keep t within a range — T/2 to + T/2. The 
integral 


| | LEE E (2.2.8) 


is now an absolutely convergent ordinary integral over a finite number 
of variables. The action S,,., is given its obvious discrete 
approximation. 

(3) Take the continuum limit a —> 0, and the limits of infinite volume V and 
infinite time T. 

(4) Analytically continue back to Minkowski space-time. 


The difficulties occur at step 3. Taking the limits of infinite T and V gives 
divergences of exactly the sort associated with taking the thermodynamic 
limit of a partition function — see below. Further divergences occur when 
the continuum limit a —> 0 is taken. In addition, the canonical derivation of 
(2.2.2) gives an overall normalization factor which goes to infinity as a > 0 
or as the number of space-time points goes to infinity; this factor is 
absorbed by the normalization M~. 

The limits of infinite volume and time are under good control. They are 
literally thermodynamic limits of a classical statistical mechanical system in 
four spatial dimensions. Recall, for example, that in ¢* theory one can 
write © 


S[4] =g"! | d*x(0A?/2 — m?A?/2 — A*/4!) (2.2.9) 


2.3 Renormalization 9 
where A =g'?A. Thus the integral {[d4]exp(— S[A]) is proportional to 
fraa exp { —(1/g)S[A > A,g > 1]}. (2.2.10) 


This is the partition function of a classical system at temperature 1/g, when 
the phase space is spanned by the field A, and when the energy of a given 
configuration is 


Sesa 4] = fax — 0A?/2 + m?A?2/2 + A4/4!). 


The identity between Euclidean field theory and certain classical statistical 
mechanics systems has been fruitful both in working out the rigorous 
mathematical treatment of quantum field theory (Glimm & Jaffe (1981)) 
and in finding new ways to treat thermodynamic problems (Wilson & 
Kogut (1974)). As is particularly emphasized in Wilson’s work, there is a lot 
of cross-fertilization between field theory and the theory of phase tran- 
sitions. The methods of the renormalization group are common to both 
fields, and the continuum limit in field theory can be usefully regarded as a 
particular type of second-order phase transition. 

The thermodynamic limit gives a factor exp(— pT V), where p is the 
ground state energy-density. This factor is clearly cancelled by M. All the 
remaining divergences are associated with the continuum limit a —> 0. These 
are the divergences that form the subject of renormalization. They are 
called the ultra-violet (U V) divergences. 

One notational change needs to be made now. In more complicated 
theories, there will be several fields, and the functional-integral solution of 
such a theory involves an integral over the values of a classical field for each 
quantum field. It is convenient to have a symbol for each classical field that 
is clearly related to the corresponding quantum field. The standard 
notation is to use the same symbol. Thus we change the integration variable 
in (2.2.2) from A(x) to (x), with the result that 


(0|TO(x,).--$(%y)]0> =N | [dpe (x,)...h(xy). (2.2.11) 


This is somewhat of an abuse of notation. However, it is usually obvious 
whether one is using ¢ to mean the quantum field, as on the left-hand side, 
or to mean the corresponding classical field, as on the right-hand side. 


2.3 Renormalization 


The difficult limit is the continuum limit a —> 0. There are divergences in this 
limit; this has been known from the earliest days of quantum elec- 
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trodynamics (e.g., Oppenheimer (1930)). It is possible to say that the UV 
divergences mean that the theory makes no physical sense, and that the 
subject of interacting quantum field theories is full of nonsense (Dirac 
(1981)). Luckily we can do better, for our ultimate aim need not be to 
construct a field theory literally satisfying (2.1.2)—(2.1.5). Rather, our aim is 
to construct a relativistic quantum theory with a local field as its basic 
observable. These requirements are satisfied if we construct a collection of 
Green’s functions satisfying sensible physical properties (for example, as 
formulated in the Osterwalder—Schrader axioms — see Glimm & Jaffe 
(1981)). We may further ask that we find a theory that is close in some sense 
to satisfying the defining equations (2.1.2)-(2.1.5). Combining the func- 
tional integral with suitable renormalizations of the parameters of the 
theory satisfies these requirements. 

The basic idea of renormalization comes from the observation that in 
one-loop graphs the divergences amount to shifts in the parameters of the 
action. For example, they change the mass of the particles described by ġ(x) 
from the value m to some other effective value, which is infinite if m is finite. 
Renormalization is then the procedure of cancelling the divergences by 
adjusting the parameters in the action. To be precise, let us consider the oł 
theory with 

L =(0A,)*/2 — mo Ag/2 — gAG/4! + Ao. (2.3.1) 
The subscript zero is here used to indicate so-called bare quantities, i.e., 
those that appear in the Lagrangian when the (0A,)7/2 term has unit 
coefficient. (We also introduce a constant term. It will be used to cancel a 
UV divergence in the energy density of the vacuum.) Then we rescale the 
field by writing 

Ay = Z'7A, (2.3.2) 

so that, in terms of the ‘renormalized field’ A, the Lagrangian is 

L =Z0A?/2— meZA?/2—9,Z7A*/4! 

= Z0A?7/2 — mg A*/2 — gg A*/4!. (2.3.3) 
We have dropped A, from £ since it has no effect on the Green’s functions. 

The Green’s functions of the quantum field ¢ are now obtained by using 
(2.3.3) as the Lagrangian in the functional integral (2.2.2). We let Z, mọ, and 
go be functions of the lattice spacing a, and we choose these functions (if 
possible) so that the Green’s functions of ¢ are finite as a— 0. If this can be 


done, then we have succeeded in constructing a continuum field theory, and 
it is termed ‘renormalizable’. The theory may be considered close to solving 
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(2.1.2)-(2.1.5). This is because the theory is obtained by taking a discrete 
(i.e., lattice) version of the equations and then taking a somewhat odd 
continuum limit. 

We will call mọ the bare mass, and g, the bare coupling, and we will call Z 
the wave-function, or field-strength, renormalization. It is also common to 
call m, and g, the bare mass and coupling; but for the sake of consistency 
we will not do this in this book. 

Another way of viewing the renormalization is to write (2.3.3) as 


L = 0A*/2 — m? A?/2 — gA*/4! 
+ §Z0A2/2 — ôm? A2/2 — 5g A*/Al. (2.3.4) 


We will call the first three terms the basic Lagrangian and the last three the 
counterterm Lagrangian. The renormalized mass m and the renormalized 
coupling g are finite quantities held fixed as a0. The counterterms 
ôZ = Z — 1, ĝm? = m? —m’, and ôg = gp — g are adjusted to cancel the 
divergences as a—>0. This form of the Lagrangian is useful in doing 
perturbation theory; we treat 04?/2 — m? A?/2 as the free Lagrangian and 
the remainder as interaction. The expansion is in powers of the re- 
normalized coupling g. The counterterms are expanded in infinite series, 
each term cancelling the divergences of one specific graph. 

The form (2.3.4) for Z also exhibits the fact that the theory has two 
independent parameters, m and g. The counterterms are functions of m, g, 
and of a. 

We will discuss these issues in much greater depth in the succeeding 
chapters. For the moment it is important to grasp the basic ideas: 


(1) The self-interactions of the field create, among other things, dynamical 
contributions to the mass of the particle, to the potential between 
particles, and to the coupling of the field to the single particle state. Thus 
the measured values of these parameters are renormalized relative to 
the values appearing in the Lagrangian. 

(2) These contributions, or renormalizations, are infinite, in many cases. 
The most important theorem of renormalization theory is that they are 
the only infinities, in the class of theories called ‘renormalizable’. 

(3) The infinities are cancelled by wave-function, mass, and coupling 
counterterms, so that the net effect of the interactions is finite. 

(4) To make quantitative the sizes of the infinities, the theory is constructed 
as the continuum limit of a lattice theory. The infinities appear as 
divergences when the lattice spacing goes to zero. 
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2.4 Ultra-violet regulators 


In the last sections we showed how to construct field theories by defining 
the functional integral as the continuum limit of a lattice theory. Ultra- 
violet divergences appear as divergences when the lattice spacing, a, goes to 
zero, and are removed by renormalization counterterms. The lattice 
therefore is a regulator, or cut-off, for the UV divergences. 

To be able to discuss the divergences quantitively and to construct a 
theory involving infinite renormalizations, it is necessary to use some kind 
of UV cut-off. Then the theory is obtained as an appropriate limit when the 
cut-off is removed. There are many possible ways of introducing a cut-off, of 
which going to a lattice is only one example. The lattice appears to be very 
natural when working with the functional integral. But it is cumbersome to 
use within perturbation theory, especially because of the loss of Poincaré 
invariance. There are two other very standard methods of making an ultra- 
violet cut-off: the Pauli—Villars method, and dimensional regularization. 

The Pauli—Villars (1949) method is very traditional. In its simplest 
version it consists of replacing the free propagator i/(p* — m?) in a scalar 
field theory by 

l i 
p-m p-M? 
. 2 2 
DEE SEE (2.4.1) 
(p — m°) (p — M°) 
As M —> œ, this approaches the original propagator. The behavior for large 
p has clearly been improved. Thus the degree of divergence of the Feynman 
graphs in the theory has been reduced. All graphs in the ¢* theory, except 
for the one-loop self-energy are in fact made finite. In the ġ4 theory it is 
necessary to use a more general form in order to make all graphs finite: 
i i (m? — M3) 
(p?—m?) (p* — Mj) (Mj — M3) 
i (m? — M?) 
(p? — M3) (M3 — M3) 
i (M?—m?) (M3 — m?) 
(p? — m°) (p? — Mj) (p° — M3)’ (2.4.2) 


S,;(p,m; M) = 


S-(p,m;M,,M,)= 


It is usually convenient to set M, = M). 

Now the regulated propagator has extra poles at p? = M?, or at p? = Mj 
and p? = M2. Since one of the extra poles has a residue of the opposite sign 
to the pole at p? = m?, the regulated theory cannot be completely physical. 
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It is normally true that a theory with an ultra-violet cut-off has some 
unphysical features. 

Perhaps the most convenient regulator for practical calculations is 
dimensional regularization. There it is observed that the UV divergences 
are removed by going to a low enough space-time dimension d, so d is 
treated as a continuous variable. In perturbation theory this can be done 
consistently (Wilson (1973)), as we will see when we give a full treatment of 
dimensional regularization in Chapter 4. However it has not been possible 
to make it work non-perturbatively, so it cannot at present be regarded asa 
fundamental method. 

Since it is only the renormalized theory with no cut-off that is of true 
interest, the precise method of cut-off is irrelevant. In fact, all methods of 
ultra-violet cut-off are equivalent, at least in perturbation theory. The 
differences are mainly a matter of practical convenience (or of personal 
taste). Thus dimensional regularization is very useful for perturbation 
theory. But the lattice method is maybe most powerful when working 
beyond perturbation theory; it is possible, for example, to compute the 
functional integral numerically by Monte-Carlo methods (Creutz (1980, 
1983), and Creutz & Moriarty (1982)). 

Within perturbation theory one need not even use a cut-off. 
Zimmermann (1970, 1973a) has shown how to apply the renormalization 
procedure to the integrands rather than to the integrals for Feynman 
graphs. The lack of fundamental dependence on the procedure of cut-off is 
thereby made manifest. The application of this procedure to gauge theories, 
especially, is regarded by most people as cumbersome. 


2.5 Equations of motion for Green’s functions 


We have defined a collection of Green’s functions by the functional integral 
(2.2.2). (Implicit in the definition are a certain number of limiting 
procedures, as listed below (2.2.6).) This definition we will take as the basis 
for the rest of our work. First we must check that it in fact gives a solution of 
the theory. This means, in particular, that we are to derive the equations of 
motion (2.1.10) for the Green’s functions, thus ensuring that both the 
operator equation of motion (2.1.2) and the commutation relations (2.1.5) 
hold. (For the remainder of this chapter we will not specify the details of 
how renormalization affects these results.) | 

It is convenient to work with the generating functional (2.2.5). We make 
the change of variable A(x) A(x) + ef (x), where ¢ is a small number, and 
f(x)is an arbitrary function of x“. Since the integration measure is invariant 
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under this shift, the value of the integral is unchanged: 


[toA]exp} ist +ef]+ fis tenu} [[aslex fista] + fast. 
(2.5.1) 


Picking out the terms of order € gives 


fesso [fa}exp4ista) + [au fi a +40 |=0 , (2.5.2) 


where, as usual, we define the functional derivative 
os dP 


7 ee (2.5.3) 


Since f(y) is arbitrary, we get 


| [d4]exp fista ie | a} li O l =~0. (2.5.4) 


Functionally differentiating N times with respect to J, followed by setting 
J =0, - the equation of motion (2.1.10). For example, 


0=V [left-hand side of (2.5.4) ],_, 


Te 


=ar fraes acis Al F TE I(x = | 


= NW | [dA] {1 A(x) [ — DA(y) — P'(A(y))] + 5 — y)} 
=. | [dA Je!" 4(x) A(y) 


—iN | [d4]e414(x)P'(A(y)) + 8x y) 
= —i07,<0] Fa (y)|0> = i <O TAPPY) + 54x — y) 


=i<0|T¢(x)———|0) + 64x — y), (2.5.5) 


Te ) 
which is equivalent to (2.1.9). Note that in the fourth line we have exchanged 
the order of integration and of differentiation for the [1], term. We have also 
used the normalization condition (2.2.4). It is important that the derivative 
of the quantum field (next-to-last line) is outside the time ordering, and 
6S/d@(y) in the last line is defined to be a shorthand for the combination of 
Operators in the previous line. 
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This is somewhat paradoxical since we have the operator equation of 
motion: 


=< = -0¢-P'($), 


from which it is tempting to deduce that the Green’s function | 
<0| To(x)dS/5¢(y)|0> should be zero. However, in view of the work above 
it is convenient to define this Green’s function by the functional-integral 
formula 


ôS 
ôA(y) 


Then, as we have seen, the , is implicitly outside the time-ordering. 
Bringing it inside the time-ordering gives a commutator, so that we get the 
6-function term in (2.1.9) or (2.5.5). 

The momentum-space version of the equation of motion (2.1.10) is often 
useful. We define the momentum-space Green’s functions 


ôS | 
<0| To ggl” = W [jaaa 


Gyl Pis- --, Py) = [asx ...dtxy exp {ilp xi +°°* + Py’ Xy)}Gy(X1,---5 Xy) 


= Gy(P1,---sPy)(2m)"6(p, +: + pr). (2.5.6) 


The momenta p, are to be regarded as flowing out of the Green’s functions. 
Translation invariance of the theory implies the 6-function for momentum 
conservation that is explicitly factored out in the last line of (2.5.6). A 
convenient notation (which we will use often) is to write 


Gy(P15++s Pn) = <0| TO(P1)-.- A(Py)|0. (2.5.7) 


Implicit in this formula is the definition that the integrals over x defining the 
momentum-space field (p) are all taken outside the time-ordering, as 
stated in (2.5.6). We will use a tilde over the symbol for any function to 
indicate the Fourier-transformed function. 

Fourier transformation of the equation of motion (2.1.10) gives 


ar q? <0| T 6(q)0(P,). .$(Py)|O> 
+ <0|TP'($)(q)b(p;)...4(py)|9> 


= —i È ¢0|TH(py).--B(P;-1)B(Pj+.1)---B(Py)|0> CIA + p). (2.5.8) 


2.6 Symmetries 


We now turn to the consequences of symmetries. As we will see, there are 
many interesting problems in renormalization theory that stem from the 


16 Quantum field theory 


following question: If a classical field theory has certain symmetries, does 
the symmetry survive after quantization? Generally, it is the need for 
renormalization of the theory that makes this a non-trivial question. 

The symmetry properties are expressed in terms of Green’s functions by 
the Ward identities. (Historically the earliest example was found by Ward 
(1950) in QED.) If the symmetry is not preserved by quantization there are 
extra terms called anomalies. In many cases there are no anomalies, so we 
will derive the Ward identities in this section ignoring the subtleties that in 
some cases lead to anomalies. Discussion of anomalous cases is given in 
Chapter 13. 

Consider a theory of N fields which we collectively denote by a vector 
$ =(¢,,---> Py). Our discussion is general enough to include the case of 
fields with spin. We consider a symmetry group of the action S[@]. This isa 
group of transformations on the classical fields 


¢—F[¢;0]=¢’, (2.6.1) 
which leaves the action invariant: | 
S[o’] = slo]. (2.6.2) 


Here œw = (w°) is a set of parameters of the group, which we assume here to 
be a Lie group, i.e., the w’s take on a continuous set of values. We let w = 0 
be the identity: F[@;0]=@. It is easiest to work with infinitesimal 
transformations: 


OF. 
. = wœ — fo: =g. : 2.6. 
i= ilgo]  =0°,¢, 2.6.3) 


(A summation convention on « is understood.) 
In the quantum theory the symmetry is implemented as a unitary 
representation U(w) of the group on the Hilbert space of states such that 


U(w)@U() t = F[¢; a]. (2.6.4) 
Since the representation is unitary, we may parametrize the group so that 
U(m) = exp (iw"Q,), (2.6.5) 


where the generators Q, are hermitian operators which represent the Lie 
algebra of the group: 


[Q,, Q;] a 1C,pyQ,- (2.6.6) 


The normalizations are such that the structure constants c,,, are totally 
antisymmetric. The infinitesimal transformations are then given by: 


6,6(x) = iLQ,, &;(x) ]. (2.6.7) 


(1) 


(2) 


(3) 


(4) 


(5) 


2.6 Symmetries 17 


There are a number of special cases, each with its own special features: 


Global internal symmetry: A finite-dimensional Lie group acts on the 
fields at each point of space-time, with the same transformation at each 
point. Thus 


5.6; = — ilt) D; (2.6.8) 
where the t, form a hermitian matrix representation of the Lie algebra: 
[taste] =ic,,,t,- (2.6.9) 


Single-particle states carrying this representation are annihilated by ġ;. 
The Lagrangian is invariant. 

Global space-time symmetry : The group effectively is a transformation 
on space-time; the Poincare group and its extensions are the usual 
cases. For a Poincaré transformation x*—A*4x’+ a", we have the 
corresponding transformation of the fields: 


b(x) > (Ax + a)R(A));. (2.6.10) 


Here R is a finite-dimensional matrix representation of the Lorentz 


group (never unitary if non-trivial), acting on the spin indices of @. The 


Lagrangian is not invariant. It transforms as 
Lip, x] > £[¢,Ax +a], 


so that the action S = /d*x# is invariant. Infinitesimal transformations 
of @ involve the derivative of œ. 

Global chiral symmetry: This looks like a global internal symmetry but 
acts differently on the left- and right-handed parts of Dirac fields (which 
we have yet to discuss). Anomalies are often present — see Chapter 13. 
Supersymmetry: This is a generalized type of symmetry where Bose and 
Fermi fields are related (Fayet & Ferrara (1977)). The only case that we 
will discuss is the BRS-invariance (Becchi, Rouet & Stora (1975)) of a 
gauge theory. 

Gauge, or local, symmetry: Any of the above symmetries may be 
extended to a symmetry whose parameters depend on x:w = w(x). In 
quantum theories, these are not really implemented by unitary 
transformations. Their treatment is rather special. The elementary 
examples are general coordinate invariance in General Relativity, and 
gauge invariance in electromagnetism. 


The basic tool for discussing symmetries is Noether’s theorem, which 


relates them to conservation laws. This theorem in its most straightforward 
form applies only to symmetries of the first three types. 
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For a global symmetry, Noether’s theorem asserts that a conserved 
current j“ exists for each generator of a symmetry. Let the Lagrangian have 
the infinitesimal transformation 


LSL FOOL =f roo, y 7, (2.6.11) 
so that the action is invariant. Define 
OL 
S rE (2.6.12 
AETS ) 


Then the equations of motion imply conservation of j#, i.e., 
0jt/ox* =Q. (2.6.13) 


The generators of the symmetry group are 
Q,= |d°xj?. (2.6.14) 


The canonical commutation relations imply that 
Lie(x), by) ] = — 16, 6(%)6 (x — y), (if x° = y’), 
[Q p:y)] on 10,0 ,(y), (2.6.15) 


as required by (2.6.7). 

We need to consider not only transformations that are symmetries of the 
quantum theory, but also ‘broken symmetries’. There are several cases (not 
mutually exclusive). Let us define them, since there is a certain amount of 
confusion in the literature about the terminology: 


(1) Explicit breaking: The classical action has a non-invariant term. If 
0,2 =0,Y'+A, then the Noether currents are not conserved: 
„jk = A, An important case is where this term is small, so that it can be 
treated as a perturbation. 

(2) Anomalous breaking: Even though the classical action is invariant, the 
quantum theory is not, and there is no conserved current. The classical 
action is important for the quantum theory, since it appears in the 
functional integral defining the theory. The cause of anomalous 
breaking is generally an ultra-violet problem: 0, jf # 0 in the UV cut-off 
theory, and the non-conservation does not disappear when the cut-off 1s 
removed. (Cases are conformal transformations and some chiral 
theories.) 

(3) Spontaneous breaking: The action is invariant and the currents are 
conserved (in the quantum theory), but the vacuum is not invariant 
under the transformations. 


Whether or not a symmetry is broken either spontaneously or anom- 
alously is a dynamical question. That is, one must solve the theory, at least 
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partially, to find the answer. Frequently, perturbation theory is adequate to 
do this and lowest order or next-to-lowest order calculations suffice. 
Renormalization is an integral part of treating anomalous breaking (see 
Chapter 13), while renormalization-group methods are sometimes neces- 
sary in treating spontaneously broken symmetry (Coleman & Weinberg 
(1973)). 

The case of spontaneous symmetry breaking that is not visible in 
perturbation theory is often termed dynamical (Jackiw & Johnson (1973), 
Cornwall & Norton (1973), and Gross (1976)). Anomalous breaking is 
sometimes called spontaneous, but this is a bad terminology, because it 
gives two very different phenomena the same name. 


2.7 Ward identities 


Ward identities express in terms of Green’s functions the consequences of a 
symmetry (whether or not it is broken). One derivation applies the equation 
of motion (2.1.10) to the divergence of a Green’s function of the current j*. 
There are two terms: one in which the current is differentiated, and one in 
which the 6-functions defining the time-ordered product are differentiated. 
Thus a Ward identity expresses not only conservation of its current but also 
the commutation relation (2.6.15), which is equivalent to the transfor- 
mation law. The Ward identities are central to a discussion of the 
renormalization of a theory with symmetries, expecially if spontaneously 
broken. 
Our derivation of Ward identities begins by making the following change 
of variable: 
Aix) > A(x) + f(x)5,4,() (2.7.1) 


in the functional integral for the generating functional Z[ J]. Here 6,4; is, as 
before, the variation of the field 4; under a symmetry transformation, and 
f(x) is a set of arbitrary complex-valued functions that vanish rapidly as 
x— ©. We get 


Z[J] = | [dA] exp {iS[A +76,A] + JA, +f76,4,)}. (2.7.2) 


(Here we assumed that the measure is invariant under the change of 
variables (2.7.1).) The terms in (2.7.2) that are linear in f* give 


0= {teal exp iS + pa) — dS[A + f76,A ]/6 f(y) + iJ'6,4,(y)} 


= | [dA] exp (is+ pahe, jt, gly) + iJ'5,A(y)}. (2.7.3) 


Here jt, , is the Noether current in the classical theory. 
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The Ward identities follow by functionally differentiating with respect to 
the sources J(x). Thus one differentiation gives 


Ô . 
gyr OTRO 0> = = i8 = yO), (2.74) 


while a double differentiation gives 


5 601TH (y)(w) (x) |0> 


— 1d (w — y) <0| T8 p:i) (X)|0> 
15 (x — y) 0| Th,(v)5,6,(9)|0D. (2.7.5) 
Note that, just as in our derivation of the equation of motion for Green’s 


functions in Section 2.5, the derivative 0/0)" is outside the time-ordering. 
The a case is: 


COT (y) i db, |X;)|O> 


= —i z Hy =X; )<OIT, Pn V) Il P(X x;)|0>. (2.7.6) 


ix j 
Important consequences of these Ward identities are obtained by 
integrating over all ý (with y° fixed). The spatial derivatives give a surface 
term, which vanishes, so that we have, for example, 


0 
| dP 3456 0|TI9)4, (00 = — ið? — 390| 8,4010): 


The spatial integral of j° is just the charge Q°. The time derivative acts either 
on the charge or on the 6-functions defining the time-ordering; so we find 
that 


¢0| TdQ,/dt h{x)|0> + (0|[Q,, (x) ]]0>d(x° — y’) 
= —id(x° — y°)¢0|6,6,(y)|0>. (2.7.7) 
In this equation and its generalizations from (2.7.6), we may choose the 
times of the fields ¢,(x,) not to coincide with y°. Therefore an arbitrary 
Green’s function of dQ, /dt is zero, so that the operator dQ_/dt is zero. The 
remaining part of (2.7.7) therefore gives: 


COJE. b:(x) ]]0> = —i¢ 0] 5, 9,(y)|0>. (2.7.8) 
From (2.7.8) and its generalizations with more fields, we find that the Q,’s 
have the correct commutation relations with the elementary fields ¢, to be 


the generators of the symmetry group. 
Finally, another specialization of (2.7.6) is to integrate it over all y“ and to 
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drop the resulting surface term. The result is that 


0= Ð <0|T6,¢,,(x) TI 4,,(x)|0> 
j=1 ifj 


=8,<0|T I] $,,(x)|0>. (2.7.9) 


All the above equations are true for the case of a completely unbroken 
symmetry. The derivation breaks down at the first step if we have 
anomalous breaking. (In Chapter 13 we will discuss the anomaly terms that 
must then be inserted in the Ward identities to make them correct.) For an 
explicitly broken symmetry, where 0: j = A#0, we must add a term 


N 
¢0|TA,(y) II Pn: (x:)|0> (2.7.10) 


to the right-hand side of (2.7.6). 

In the case of a spontaneously broken theory the basic Ward identities 
(2.7.4)-(2.7.6) remain true -— we still have an exact symmetry. But the 
integrated Ward identities (2.7.7) and (2.7.9) are no longer true. Equation 
(2.7.9) must be false if the vacuum is not invariant, and the derivation fails 
because the surface term is not zero. This is caused by the existence of zero- 
mass particles. These Nambu-Goldstone bosons (Goldstone, Salam & 
Weinberg (1962)) are characteristic of theories with a spontaneously 
broken symmetry. 
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As an example, consider again ¢* theory, with classical Lagrangian 
L = Z(0A)*/2 — mp A?/2 — g,A*/4!. (2.8.1) 
We will expand the Green’s functions in powers of the renormalized 


coupling g, for small g. To expand the functional-integral formula (2.2.2) in 
powers of g, we write 


PF S (2.8.2) 
where &, is the free Lagrangian: 
L, =(0A)?/2 — m?A?/2, (2.8.3) 


and & is the interaction Lagrangian: 
L, = —gA*/4!+(Z —1)(0A)?/2 — (mz — m?)A?/2 — (gg — g)A*/4!. 
(2.8.4) 
We will expand the renormalization counterterms, Z — 1, mj — m°, and 
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Jg — g,in powers of g,so that all of the terms in Y, have at least one power of 
g. The series expansion of the Green’s functions is then obtained from (2.2.2) 
as: 


Gil Xi Astia NN) 


y erin) [[44]40s,) : Avs] | d*yL oT exp (iSo[4]) 
__ hn=0 


= : (2.8.5) 
È in) | a| | aye | exp (iSo[A]) 


Here 


So[4] = |dty 2, = |d*y(0A2/2 — m?A?/2) 


is the free action. 

Each of the terms in the series is a Green’s function in the free-field theory 
(aside from a common normalization), so (2.8.5) is equivalent to the Gell- 
Mann-Low (1951) formula: | 


oe eee ee 
Y G"/n ( Ti | ay, )co Torli). prn) TI Av 10> 
y arin TI | dty,)<o TTT 410) 


n= j=l 


(2.8.6) 


Here ġp is a free quantum field of mass m. It is the field generated from the 
free Lagrangian Z =(0¢,)?/2 —m*¢?/2. Then Z is the quantum in- 
teraction Lagrangian, Z — £, which is a function of the free field p. 

To compute the integrals in (2.8.5) it suffices to compute the generating 
functional of free-field Green’s functions: 


[ta4exp(is.041 + fva) 
TAA Se ee (2.8.7) 
[d4]exp(iS,[A]) 


This is done by completing the square, i.e., by making the following change 
of variable: 


A(x) — A(x) + [ivaa — y)J(y). (2.8.8) 


Here, G(x) is the Feynman propagator satisfying 


(O + m2)G,(x) = — iô®(x), (2.8.9) 
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and a boundary condition that, after rotating to Euclidean space by 


x? = —it, G,(x) 0 as x > œ. Thus 


G = dk wes (2.8.10) 
p(x) — (2n)* k2 = m + P „0. 
The result is that 
Z[J] = exp l [atxatyatercets — nao}. (2.8.11) 


Green’s functions of free fields are obtained by differentiating with respect 
to J; for example 
OIT 0 a2 
= G,(x — y). (2.8.12) 


We can now derive the well-known Feynman rules for the interacting 
theory from (2.8.6). These can be given either in momentum or coordinate 
space. In either case the Green’s function Gy is written as a sum over all 
possible topologically distinct Feynman graphs. Each graph T consists of a 
number of vertices joined by lines. It has N ‘external vertices’, one for each 
o(x,), with one line attached, and some number, n, of interaction vertices. 
The interaction vertices are of several types, corresponding to the terms in 
the interaction Lagrangian (2.8.4). The vertex for the A* interaction has four 
lines attached and the vertices for the 0A? and A? interactions have two 
lines attached. The value of the graph, denoted I(T), is the integral over the 
position y, of the n interaction vertices. The integrand is a product of 
factors: 


(1) G,(w — z) for each line, where w and z are the positions of the vertices at 
its end. | 

(2) A combinatorial factor 1/S(T). 

(3) —ig, for each A* interaction. 

(4) —i(mĝ — m°) for each A? interaction. | 

(5) —i(Z — 1)0?/ðw"ðw, for each (3A)? interaction: the derivatives with 
respect to w act on one of the propagators attached to the vertex. 


For each Feynman graph a number of equal contributions arise in 
expanding (2.8.5). If IFT has no symmetries and if it has no counterterm 
vertices, then this number is n!(4!)" so that the explicit n ! in (2.8.5) and the 4! 
in each interaction are cancelled. Graphs with symmetries have a number of 
contributions smaller by a factor of the symmetry number S(T). (For 
example, the self-energy graph Fig. 2.8.1 has S= 6.) The combinatorial 
factor is then the inverse of S(T). 
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Fig. 2.8.1. A graph with symmetry fac- Fig. 2.8.2. A graph with a vacuum 
tor S=6. bubble. 


The denominator of (2.8.5) is the sum of all graphs with no external lines. 
The result is to cancel all graphs in the numerator that have disconnected 
vacuum bubbles (like Fig. 2.8.2.). 

In momentum space each line is assigned a (directed) momentum k. The 
Feynman rules are: 


(1) A factor i/[(22)*(k? — m? + ie)] for a line with momentum k. 

(2) A factor (2z)* times a momentum conservation 6-function for each 
vertex (external or interaction). 

(3) An integral over the momentum of every line. 

(4) A combinatorial factor 1/S(T). 

(5) — igp for each A* interaction. 

(6) —i(m2Z — m°) for each A? interaction. 

(7) i(Z — 1)p? for each (A)? interaction, where p is the momentum flowing 
on one of the propagators attached to the vertex. 


The perturbation series in (2.8.5) need not be convergent, but only 
asymptotic. Let Gy , be the sum up to order g” of the perturbation series for 
the Green’s function Gy. Then it is asymptotic to G, if for any n the error 
satisfies 


|Gy — Gyna | = O(g"**) (2.8.13) 
as g — 0. In general, perturbation theory is asymptotic but not convergent. 
This is rigorously known (Glimm & Jaffe (1981)) for the øt theory in the 


cases that the space-time dimension is d = 0, 1, 2, 3. (d = 0 is the case of the 
ordinary integral 


fox exp (— m?x?/2 — gx*/4!), 


while d=1 is the quantum mechanics of the anharmonic oscillator.) 
Physically, the reason for non-convergence is that when g < 0 the energy is 
unbounded below and so the vacuum-state continued from g>0 is 
unstable. (Dyson (1952) first observed this phenomenon in quantum 
electrodynamics.) 
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In later chapters we will assume (2.8.13). When we compute large- 
momentum behavior, it will be important to understand the maximum 
possible validity and accuracy of the calculations if the perturbation theory 
is asymptotic but not convergent. 


2.9 Spontaneously broken symmetry 


Consider the ¢* interaction. If m? is positive and g is small, we have a theory 
of particles of mass m slightly perturbed by the interaction. This interaction 
is basically a repulsive 6-function potential, as can be seen by examining the 
Hamiltonian in the non-relativistic approximation. There is a symmetry 
o> —®. 

But if m? is negative this interpretation is incorrect. The true situation can 
be discovered by noticing that the functional integral (in Euclidean space) is 
dominated by classical fields with the lowest Euclidean action, which is 


Seuva [4] = |d*x[ — (34)? /2 + m?A?/2 + gA*/4!]. (2.9.1) 


(Remember that (0A)? = —(0A/dt)? — VA? is negative.) If m?>0, then 
the minimum action field is A = 0. But, ifm? < 0, then there are two minima; 
these are constant fields with P(A) =0, i.e., A= A, =./(—6m?7/g) and 
A=A_=-—./(—6m?/g). 

We choose to impose the boundary condition A(x) A, aS x—> œ in the 
functional integral. (The condition A—A_ gives equivalent physics, 
because of the A > — A symmetry of the action.) Then field configurations 
with A close to A, will dominate. We may understand this by observing 
that field configurations with large regions where A isnotcloseto A, or A_ 
will give small contributions to the Euclidean functional integral (2.2.8) 
because their action Stua is so big. Indeed, a constant field with A not equal 
to A, or A_ has infinitely more action than one with A = A, or A_,and its 
contribution to the integral is zero. One’s first inclination then is that the 
only configurations that contribute have A> A, or A>A_ as x> ©. 
However, other configurations contribute, because there are many of 
them — one has to integrate over all possible fluctuations. However, one can 
argue — even rigorously (Glimm & Jaffe (1981) — that in general A will be 
close to A, orto A_. A typical configuration of the classical field A(x) will 
be close to one of these values over almost all of space-time. 

Given our choice of boundary condition A— A,, even more is true: a 
typical configuration is close to A = A, almost everywhere, rather than to 
either A_ or A,. The reason is that if it had a large region with A(x) close to 
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Fig. 2.9.1. Illustrating transitions between regions with fields close to different 
minima of the potential 


A_ (Fig. 2.9.1), then there would be a contribution to the action pro- 
portional to the size of the boundary between the regions of positive A(x) 
and of negative A(x). Only if the space-time dimension is d = 1 will we have 
a finite contribution from the boundary. This special case is quantum 
mechanics of a particle in a potential with two wells. The particle can tunnel 
between the two wells. 

In the case we have discussed, of a discrete rather than of a continuous 
symmetry, the argument that A is close to A, almost everywhere for the 
important configurations is correct in all space-time dimensions greater 
than one. The quantum field therefore has a vacuum expectation value close 
to A,: 


<0|6(x)|0> = W [fda] ataye aa ~A,. 


In the case of a continuous symmetry, there is a continuous series of 
minima of the potential. A field configuration can interpolate between 
different minima without going over a big hump in the potential. The only 
penalty comes from the gradient terms in the action. This suppresses 
configurations that do not stay close to one minimum, but only in more 
than two space-time dimensions. In one space-time dimension there is no 
spontaneous breaking of a continuous symmetry (Mermin & Wagner 
(1966), Hohenberg (1967), and Coleman (1973)). 

Perturbation theory can be considered as a saddle point expansion about 
the minimum of the action. We write A(x) = A'(x) + v where v = A,. Now 
we treat A(x) as the independent variable. We have 


EP =(0A')?/2 — M?A’?/2 — gvA3/3!—gA"4/4!+C. (2.9.2) 


Here C = — m?v?/2 — gv*/4!, and M? = gv?/2 + m? = — 2m? > 0. We now 
have a theory of particles of mass M with both an 4° and an A” 
interaction. The symmetry is hidden; its only obvious manifestation is in 
the relation between the A’* coupling and the mass and A” terms: 


gv = M(3g)'’?. (2.9.3) 
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We will show later that renormalization counterterms are correctly given 
by continuing in m? from positive m”. The vacuum expectation value of ¢ 
has corrections which can be computed in perturbation theory 


<0|¢|0> =v + ¢0|¢'|0> 
=v + O(g!), (2.9.4) 


Exactly similar methods can be applied if there is a continuous symmetry. 
Then the Goldstone theorem tells us that there will be a massless scalar 
particle for each broken generator. 


2.10 Fermions 


The field theories obtained by functional integration as in Section 2.2 
are all theories of bosons. This follows from the symmetry of the Green’s 
functions under exchange of fields (eg, <0|/T¢(x)d(y)|0> = 
<0|T¢(y)(x)|0>). In turn, this symmetry property follows from the 
functional-integral formula (2.2.2) because the integration variables (the 
values of the classical field A(x)) commute with each other. 

To get a theory with quantized fields, it is necessary to define something 
like an integral over anticommuting variables. A rather small number of 
properties of integration are needed to derive the equations of motion for 
Green’s functions. Requiring these properties determines the integration 
operation uniquely (Itzykson & Zuber (1980)). 

As an example, consider the following Lagrangian for a free Dirac field: 


L=Wid — Mw. (2.10.1) 


Here y is a four-dimensional column vector and wW a row vector, while 
@ = y"0,. The generating functional of Green’s functions is written as: 


Zini] =" | [dy dý] exp (i | L+ | nw + | in), (2.10.2) 


The fields and the sources n(x) and n(x) take their values in the fermionic 
sector of a Grassmann algebra. In the lattice approximation the definition 
of the integration in (2.10.2) is really algebraic (Itzykson & Zuber (1980)). 
Green’s functions are defined by differentiating with respect to the sources. 
One important difference between ordinary integration and Grassmann 
integration will be important in treating gauge theories. The simplest case 
of this difference is in the integral over two variables x and x of exp (ixax), 
where a is a real number. For ordinary real variables the integral is 


[ax dxe'*** =2n | dxô(xa) = 2r/a. (2.10.3) 
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For Grassmann variables we get 
| dxdxei*** = ia. (2.10.4) 


The overall normalization is irrelevant for our applications, for it is always 
cancelled by the overall normalization factor in the functional integral. 
What matters is that the a-dependence of (2.10.4) is inverse to that in 
(2.10.3). 


2.11 Gauge theories 


A gauge symmetry is an invariance under a group G where the group 
transformation is different at each space-time point. The earliest examples 
were General Relativity (where G is GL(4), the group of linear transfor- 
mations of the coordinate system), and electrodynamics (where G is the 
group of phase rotations). Yang & Mills (1954) and Shaw (1955) generalized 
the idea to a general group. Beg & Sirlin (1982) and Buras (1981) explain 
some of the uses of gauge theories as theories of physics. 
Let G be a simple group and let a matter field y transform as 


W(x) > exp ( — igw*(x)t,)W(x) = U(w(x))~ y(x). (2.11.1) 
The field is a column vector of components, and the hermitian matrices t, 
form a representation of the group, with structure constants c,,, defined by 


[tatg] = ic,p,t,. (2.11.2) 

The matrices U(w) form a representation of the group. 
In order that the action be gauge invariant, we need a .covariant 

derivative: 

Dw = (0, +19A,)w. (2.11.3) 
Here we have introduced the gauge potential A,. It is a vector under 
Lorentz transformation. As far as its gauge symmetry properties are 
concerned, it can be written as a matrix A, or in terms of components 4; : 


A, = } Aite (2.11.4) 
It transforms under the gauge soup as: 
A,,(x) > U(@(x)~ *[A, (x) — ig +0, JU(@(x)). (2.11.5) 
To build an action, we need the field-strength tensor 
Fy =0,A,—d,A, + ig[A,,A,], 
Fi, = 0,45 — 0,A% — gc,,,A4A?, (2.11.6) 


which transforms as F > U~!FU. 
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A gauge-invariant Lagrangian with spin 4 matter fields is 
L iny = —(Fiy)”/4 + YGP — MY 
= —trFF*/2 + Jp — Myy, (2.11.7) 
where we assumed the conventional normalization of the t,’s, viz., 
tr t,t, = 0,,/2. In an exactly similar way, an action can be set up using scalar 
fields. If there are matter fields in several irreducible representations a term 
for each is needed in X. The transformation (2.11.5) ensures that the 


coupling g is the same for all matter fields if the group is non-abelian. 
The form of the infinitesimal transformations is needed: 


Ô Y = — igw°t Y, 
ô ý =igwWt,, 


ÔA, = 0,0" + gC,,,0*°A”, 
bP iy = Glug, F”,. (2.11.8) 


If the group is not simple, then it is the product of several simple groups, 
e.g., SU(2)@®SU(2). For each there is a gauge field and an independent 
coupling. 


2.12 Quantizing gauge theories 


A gauge theory such as the one defined by the Lagrangian (2.11.7) can be 
solved by the functional integral. Thus, as an example, we can write for the 
fermion propagator. 


(0| Ty (x)P(y)|0> =N cr | [d4] [dydý]y(x)ý(y)exp (i | L 5 
(2.12.1) 


In fact, a lattice approximation to the functional integral forms the basis of 
Monte-Carlo calculations (Creutz (1983) and Creutz & Moriarty (1982)). 
The only trouble with (2.12.1)is that it is exactly zero. To see this we observe 
that, given any field configuration, we can make a gauge transformation on 
it, as in (2.11.1) and (2.11.5). The new field configuration has the same action 
as the old field. Thus the only dependence on the gauge transformation is in 
the explicit y/(x)(y). Now the gauge transformation is independent at each 
space-time point. So (2.12.1) contains a factor 


( I] fave )o- @U(y) 


points z 


= faumberi( fauco ')( favor»), (2.12.2) 
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which is zero. (Note that the propagator is a matrix in the representation 
space of the gauge group.) 

The vanishing of (2.12.1) is not a fundamental problem, for we may 
choose only to work with Green’s functions of gauge-invariant operators 
(e.g., Py, F%,F%", the Wilson loop (Wilson (1974) and Kogut (1983))). But 
the vanishing is a disaster for formulating perturbation theory; for among 
the basic objects needed to write the Feynman rules are the propagators for 
the elementary fields. An elegant solution to this problem was given by 
Faddeev & Popov (1967). The integral over all gauge fields is written as the 
product of the integral over fields satisfying some given gauge condition 
(such as 0- A* = 0) and of the integral over all gauge transformations. Any 
field configuration can be obtained by gauge transforming some con- 
figuration that satisfies the gauge condition. For a gauge-invariant Green’s 
function, the integral over gauge transformations amounts to an overall 
factor which cancels an inverse factor in the normalization. So the integral 
over gauge transformations can be consistently omitted. 

The new integral over fields with the gauge-fixing condition imposed also 
provides a solution to the theory. But the gauge-variant Green’s functions 
like (2.12.1) no longer vanish. It is necessary, moreover, to find the correct 
measure for the integral; this was the key point of the work of Faddeev and 
Popov. 

These authors also constructed a slightly different formulation; it is this 
formulation that is most often used, and that we will review now. A detailed 
treatment and further references are to be found in Itzykson & Zuber (1980). 
Here we will merely summarize the argument and derive the Ward 
identities in the form that we will use them. 

We will consider gauge conditions of the form F,[ A, x] =/,(x). There is 
one condition at each point of space-time and for each generator of the 
group. The functional F, might be d- A’, for example. The functions f,(x) are 
any real valued functions of x. 

Faddeev and Popov write an arbitrary Green’s function as 


<0|TX 


0) =H, [[AA LAV] LAT] X exp SAL] TTAF, ~ 1) 

ae (2.12.3) 
Here S;„ is the gauge invariant action, and X is any product of fields. The 
factor A[ A ] is a Jacobian that arises in transforming variables to the set of 
fields that satisfy the gauge condition plus the set of gauge transformations. 
The key result is that A[ A] is a determinant, so that it can be written as 


A= | [de, ][dc, ]exp(iY,.). (2.12.4) 
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Here c, and c, are anticommuting scalar fields, called the Faddeev—Popov 
ghosts. The so-called gauge-compensating Lagrangian is 


L p = 26,F,[A, x], (2.12.5) 


where 6,F, is the infinitesimal transformation of F, with w replaced by c. 
For the case F, = 0: A’ 
L go = ONC(O,Cy + GCypyCg A?) + divergence. (2.12.6) 


apy 


We treat c and ¢ as independent fields. They are not genuine physical fields, 
as they do not obey the usual spin-statistics theorem. 

A convenient form of solution to the theory is obtained by averaging over 
all f,’s, with weight exp (— €~'{f?/2). This leaves gauge-invariant Green’s 
functions unaltered, and gives the following formula: 


<0|TX|0> = s |ia fields] Xe's (2.12.7) 


with a different normalization. The integral is over all fields (A, y, W, c, ©). 
The action S contains three terms: 


S = [tx + L of + ae ie) (2.12.8) 


We have already defined the gauge-invariant Lagrangian by (2.11.7) and 
Lc by (2.12.5). The gauge-fixing term is 
L a = — iF} (28), (2.12.9) 
where € is an arbitrarily chosen parameter. (If desired, it may be absorbed 
into a redefinition of F.) 
The advantage of the form (2.12.8) is that Green’s functions of the 
elementary fields are defined as in a simple non-gauge theory. For a gauge- 


invariant observable X the equations (2.12.3) and (2.12.7) define the same 
objects as 


{O/T X|0> = WG | [dAdydý]X exp (iS,,,). (2.12.10) 


If X is gauge variant, then all the definitions give different results, and 
(2.12.7) depends on &. Quantities that depend on the choice of a gauge fixing 
are called gauge dependent, of course. We see that gauge invariance of the 
operators in a Green’s function implies gauge independence. 

It is important to distinguish the concepts of gauge invariance and gauge 
independence. Gauge invariance is a property of a classical quantity and 1s 
invariance under gauge transformations. Gauge independence is a property 
of a quantum quantity when quantization is done by fixing the gauge. It is 
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independence of the method of gauge fixing. Gauge invariance implies 
gauge independence, but only if the gauge fixing is done properly. 

Gauge theories such as (2.11.7) have a dimensionless coupling if space- 
time is four dimensional. General results, which we will treat in later 
chapters, imply that the theories need renormalization. However these 
same results imply that many more counterterms may be needed than are 
available by renormalizing (2.11.7). In Chapter 12 we will prove that the 
extra couplings are absent. The tools needed are the Ward identities for the 
gauge symmetry. These we will prove in the next section. It is also necessary 
to prove that the unphysical degrees of freedom represented, for example, 
by the ghost fields c, and c, do not enter unitarity relations. This proof also 
needs the gauge properties exhibited in the Ward identities (see Itzykson & 
Zuber (1980)). 


2.13 BRS invariance and Slavnov—Taylor identities 


After gauge fixing, the gauge invariance of a gauge theory is no longer 
manifest in the functional-integral solution. Slavnov (1972) and Taylor 
(1971) were the first to derive the generalized Ward identities that carry the 
consequences of gauge invariance. Their derivation was very much 
simplified by Becchi, Rouet & Stora (1975) through the discovery of what is 
now Called the BRS symmetry of the action (2.12.8). 

BRS symmetry is in fact a supersymmetry, that is, its transformations 
involve parameters that take their values in a Grassmann algebra. Let ôA be 
a fermionic Grassmann variable. Then the BRS transformation of a matter 
or a gauge field is defined to be a gauge transformation with w* = c*6A. Thus 

OprsW = — ig(c*dA)t,W = igt,” Yô, 
Sars = igt,c*5A, | 
Opps Ay = (0,,c% + gc,p,c?A?)OA. (2.13.1) 
Observe that 6/ is fermionic, so it anticommutes with fermion fields (c, ¢, Y, 
ý). The ghost fields transform as 


prs = —ZGICap, CCA, 
OprsC, = F,04/€. (2.13.2) 
(Note that c* and ¢c, are not related by hermitian conjugation, contrary to 


appearances.) Since £, is invariant under gauge transformations, it is 
BRS invariant. Hence 


Opes Z = Oprs(L gt T L gc) 
= — (1/E)F 5 e5,F gL 43x] — (1/0) F,546,F, — C5prs(5.F a) 
=Q. (2.13.3) 
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In the second line we used ¥,, = — ¢,0,F,, while to prove the last line zero 
we anticommuted 6/ and c in the first two terms of the second line. In 
addition we used the nilpotence of the BRS transformation: 


2 
(s) (y or ù or A% or c*) =0, 
A 


Opps z =0 (2.13.4) 
oA oo. te 


which follows from anticommutativity of the c’s. 
By applying the Noether theorem we find a conserved current: 
= gyt yer — F Dc — (1/8): A D" 
—5g(0%E,)cPC’Cyp,. (2.13.5) 
Although the BRS transformations involve Grassmann-valued para- 
meters, the derivation of Ward identities given in Section 2.7 goes through 
unchanged. For our purposes, we only need the integrated Ward identity 
(2.7.9). A case of (2.7.9) applied to BRS invariance is called a Slavnov— 
Taylor identity. A simple example is 
0 = dpps<0|TA2(x)eg(y)]0>/52 
= — (0|T(6,c, + 9C,5,€°A”)Eg(y)|0> 
+ (1/€)<0| T A*(x)d-A*(y)|0>. (2.13.6) 
We have defined the notation 6,,, (quantity)/dA to mean that the 6/ in the 
BRS variation is commuted or anticommuted to the right and then deleted. 
The most used cases of the Slavnov—Taylor identities are: 
0 = dprs<0| TXC,(x)|0>/d/ 
= — (0| T(Ogps X /dA)E,|0> + (1/)(0| TX0-A*|0>. (2.13.7) 
Here X is a product of fields with total ghost number zero. 
We will also need equations of motion. Let: 


,u 
J BRS 


4,-F- ‘wag = (iD — Myy, 

L, = —iB—M), 

Py = — DFO — gibytt,w + (1/02 A + geug (22), 

L = -— o D", 

£ a= — OE, — gcng (Ô E) A". (2.13.8) 


Then each of these is zero. Furthermore, for @ equal to any field in the 
theory we have: 


<0|T£4(x)X|0> = i<0| TSX /ôġ(x)|0>, (2.13.9) 
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where time derivatives in £, are taken outside the time-ordering (as usual — 
see Section 2.5). 


2.14 Feynman rules for gauge theories 


Feynman rules are given in Fig. 2.14.1 for the Lagrangian of (2.11.7), with 
the gauge-fixing term F, = 0- A’. Note that these agree with the figures but | 
not the equations of Marciano & Pagels (1978). They are the rules for 
quantum chromodynamics (QCD), the theory of strong interactions, if the 
gauge group is SU(3). The fermions are the quarks and consist of several 
triplets of SU(3), each with its own mass term. (In the conventional 
terminology, the gauge field is called the gluon field and the gauge 
symmetry is called the color symmetry of strong interactions. Each 
irreducible representation in the quark field is called a flavor, and has a 
label: u, d, s, c, b, etc.) 

The same Lagrangian also describes quantum electrodynamics (QED) if 
we change the gauge group to U(1). In that case there is but one gauge field 
(the photon) and, since the group is abelian, the three- and four-point self- 
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Fig. 2.14.1. Feynman rules for the gauge theory defined by the Lagrangian (2.11.7). 
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interactions of the gauge field vanish. Moreover, with the gauge fixing term 
F[ A] = 0-A there is no coupling to the ghost fields, so we may drop them 
from consideration. The transformations on the matter fields are simple 
phase rotations: ye ‘4° where q is the charge of the field (negative for 
the electron field). The transformation of the gauge field is A, > A, + 0,0. 


2.15 Other symmetries of (2.11.7) 


The Lagrangian (2.11.7) is gauge invariant. After gauge fixing we get 
(2.12.8), which is not gauge invariant, but which has BRS symmetry. The 
action (2.12.8) also is invariant under global gauge transformations — those 
with constant œ — because we chose the gauge fixing not to break this 
symmetry. 

There are also what in strong interactions are called flavor symmetries. 
These are transformations that act identically on every member of an 
irreducible representation of the gauge group. In this case they give 
conservation of the number of each of the different flavors of quark. Other 
flavor symmetries include the discrete symmetries of parity and time- 
reversal invariance. 

Charge-conjugation is also an invariance of (2.12.8), and its action on the 
ghost fields is rather interesting. Let us define ¢, by the parity of the 
representation matrices under transposition: 


t = Eta (X). (2.15.1) 
In this and the following equations, the symbol A) means that the 


summation convention on repeated indices is suspended. The fermion and 
gauge fields transform as usual: 


A! — — Ate, (Ñ), 


Y => nli? y Y)", (2.15.2) 
where y, is a real matrix such that 
th = ni bN (2.15.3) 


The ghosts transform as: 
C7 — &%c*, CF — A (2.15.4) 
Consider the c* — č’ — A? and the Aj — A“ — A? Green’s functions. They 
are invariant under global gauge transformations so only two couplings of 
the gauge indices are possible: c,,, which is antisymmetric, and a symmetric 


coupling which we can call d,,,. Charge-conjugation invariance prohibits 
the symmetric coupling. 
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2.16 Model field theories 


Although the concepts of quantum field theory are very general, we have 
reviewed them by examining mainly two specific models. The first was the 
theory of a real scalar field (2.1.1), mostly with the øt interaction (2.2.1). The 
second was a gauge theory (2.11.7) with matter described by some Dirac 
fields. It should be clear that the general principles apply to any Lagrangian 
L.A field theory is specified by listing its elementary fields and giving a 
formula for Z. It is solved by a functional-integral representation of its 
Green’s functions. 

The aim of physics is to describe the real world. To the extent that a field- 
theoretic description is the correct one, the fundamental problem in physics 
is to find the correct field theory. In fact the Lagrangian (2.11.7) appears to 
do this for strong and electromagnetic interactions if the gauge group and 
matter fields are correctly chosen. Weak interactions can be included by the 
Weinberg—Salam theory, and many speculations have been made about 
extensions (see the proceedings of most recent conferences on high-energy 
physics). 

Our aim in this book would be badly served by only treating real 
theories. One reason is that we wish to develop techniques and concepts 
applicable to any field theory, for example not only to the many Grand 
Unified Theories currently under discussion (see Langacker (1981) and 
Ross (1981) for reviews), but also to the theories to be invented in the future. 
As is usual in the subject, we will make use of field theories that are more 
properly called models. The f theory is an obvious case. Another 
important reason for using models is to be able to discuss particular aspects 
of the methods without having other complications to clutter up the 
presentation. | 

Particular models will be introduced as needed. Some will recur often, 
such as the ¢* theory and the simple gauge theory (2.11.7). 

Another frequently used model is the ø? interaction of a real scalar field: 

L =(6o)?/2 — m o7/2—g¢?/3!. (2.16.1) 
This is much more unphysical than the other models. It is not even 
completely consistent. Because of the @° interaction, the energy is not 
bounded below. This is manifest in the classical theory and true in the 
quantum theory (Baym (1960)). Hence any state must catastrophically 
decay. But the perturbation theory is well-defined, and somewhat simpler 
than for the ¢* theory. So it proves very convenient to use the ° model in 
treating the elements of the theory of renormalization within perturbation 
theory. 
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Another way of constructing models is to change the dimension, d, of 
space-time from its physical value 4. One motivation for this is that the 
renormalization problem becomes easier as d is reduced; the degree of 
divergence of a Feynman graph decreases. As we will see in Chapter 4, it is 
both useful and possible to treat d as a continuous variable, for the purpose 
of computing the values of terms in the perturbation expansion. 


3 


Basic examples 


In the later chapters we will develop methods to treat large-momentum 
behavior. The complete treatment becomes rather intricate at times, so this 
chapter is devoted to exposing in their simplest form the issues we will be 
discussing. We will do this by examining the self-energy graph in o° theory. 
This will exhibit the basic phenomena which we will later be treating in 
detail. 

We will see that (in four-dimensional space-time) the graph is re- 
normalized by a mass counterterm. Then the concept of ‘degree of 
divergence’ will be introduced by varying d, the dimensionality of space- 
time. This device will enable us to see how simple power-counting methods 
determine what counterterms are needed. It will also introduce us to the 
method of dimensional regularization. 

The renormalization group will be introduced by examining the behavior 
of the graph as its external momentum, p, is made large. By exploiting the 
arbitrariness in the renormalization procedure, we can reduce the size of 
higher-order contributions when p” is large. 


3.1 One-loop self-energy in ¢° theory 


Consider the graph shown in Fig. 3.1.1 in the œ? theory of (2.16.1). We 
define its contribution to the self-energy to be i times the value of the graph 
with the external propagators removed: 


2 i g’ 4 1 
ET f emae a 


The overall factor 4 is a symmetry factor. 


k 


p 
Fig. 3.1.1. One-loop self-energy graph in ¢° theory. 


38 


3.1 One-loop self-energy in œ? theory 39 


When all components of k” get large, this integral diverges logarithmi- 
cally. It is the simplest example of an ultra-violet divergence. As we will see, 
the divergence can be cancelled by a mass counterterm. But to explain the 
renormalization properly, we must discuss a number of other issues as well: 


(1) The fact that if |k°| ~ |k| the divergence as k goes to infinity appears to 
be much worse. 

(2) A precise way of formulating the statement that the divergence is 
cancelled by a mass counterterm. 

(3) The arbitrariness inherent in the renormalization. 

(4) The interpretation in coordinate space. 


3.1.1 Wick rotation 


The first of these problems is handled by recalling the Wick rotation into 
Euclidean space that was used to define the functional integral. This 
rotation determined the sign of the ie in the free propagator. The Wick 
rotation involved starting with imaginary time t = — it, then performing 
the integral, and finally analytically continuing back to real time. In 
momentum space, this forces us to work with k? = + iw, the opposite sign 
appearing so that in the Fourier transformation e’* is always a phase. 

In the Euclidean formulation let us perform the k°-integral first. The pole 
structure in the k°-plane is shown in Fig. 3.1.2, when p° is imaginary. In this 
situation k? and (p +k)? are both negative, so that the integrand is positive 
definite. Observe that the factor i coming from the Wick rotation combines 
with the overall factor of i in (3.1.1) to make £, real. (We have d*k = 
idwd °k.) 

Now rotate p° back to a real value. If | p°|<m then we have the situation 
shown in Fig. 3.1.3: there is no obstruction to rotating the k? contour to run 
along the real axis. It is only at this last step that there is a problem from the 


A 


x X 


Fig. 3.1.2. The k°-plane when p° is Fig. 3.1.3. The k°-plane when p° is real, 
imaginary. but |p°| is less than m. 
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Fig. 3.1.4. The k°-plane when p° is real, but |p°| is greater than m. 


region of |k°|~|k|. To avoid the problem we merely have to define the 
integral by rotating the k°-contour to run along the imaginary axis. 

Now continue p° to the region |p°| >m. The case of negative p° is 
illustrated in Fig. 3.1.4. Again we Wick-rotate to imaginary k°, but this time 
we pick up a pole term. Now the pole term occurs only when (p°)? > m? + 
(+k? = E?,,. Thus it contributes only in a finite region of k; the UV 
divergence still comes from the integration over imaginary k°. 

The moral of all this is that the UV divergence is essentially Euclidean, 
i.e., we may regard k? as imaginary and k? <0, (p+ k)? <0. 


3.1.2 Lattice 


We next need to quantify the divergence. The divergence comes from the 
asymptotic large-k behavior of the integrand which is 1/(k*)”. Let us add 
and subtract a term with this behavior: 


x2 1 l 
ye dtk bl a ml a Al aa dA aaa e A aa 
327 (k* — m° +ie)[(p +k)? —m*+ie] (k?— p? + ie) 


1 
+ aen (3.1.2) 


The first integral is manifestly finite, for we have subtracted off the leading 
asymptotic behavior of the integrand. To avoid introducing an extra 
divergence at k? = 0 we have subtracted 1/(k* — u°}? rather than 1/(k?°}?. 
Since we add this term back on, the value of y is irrelevant; £, is unchanged. 
The second term, while divergent, is independent of p. This is the fact that 
will enable us to cancel the divergence by a counterterm. 

Of course we are manipulating divergent integrals, so that (3.1.2), as it 
stands, makes no sense. We will remedy this defect by using the fact that the 
theory is defined initially on a lattice. The propagators will then be different 
functions of momentum. However, the structure of (3.1.2) will be unchanged 
after imposing a cut-off, as we will now show. 


3.1 One-loop self-energy in ¢° theory 4] 


To define the functional integral we not only had to Wick rotate time, but 
also had to put the theory on a space-time lattice, of spacing a. In the lattice 
theory, let the free propagator of a particle of mass M be S,(q; M,a). In the 
limit ga— 0, this is just i/(q? — M? + ie). But it is zero if ga > 1, since high- 
momentum states do not exist on the lattice. (The reason is, of course, that 
when one makes a Fourier transformation on a discrete space, one only uses 
momentum modes with wave-lengths longer than a lattice spacing.) The 
self-energy on the lattice is finite, and (3.1.2) now reads: 

ig? 
32n* 


x, (p, mM, a) = f fatedsec m, a)Sp(p + k; m, a) — Sp(k H, a) | 


F [atks tin a} 


= È ifin(P, M, p,a) + 2 aivlM, u,a). (3.1.3) 


All the integrals are now convergent, so (3.1.3) is a correct version of (3.1.2). 
As the lattice spacing goes to zero, the first integral approaches the first 
convergent integral in (3.1.2). The second integral diverges, but is inde- 
pendent of p. Thus (3.1.2) is not nonsensical, provided that the propagators 
are implicitly replaced by lattice propagators wherever necessary. 


3.1.3 Interpretation of divergence 


No matter how it is manipulated, the self-energy diverges in the continuum 
limit. The use of a lattice cut-off now enables us to quantify the divergence. 
From (3.1.3) 


ig? 
25 = a [ates shin a)? + È ifin 


2 
—g 1 Se 
NEAR dod k- + finite 
32n* i +k? < 1/a2 (w? + k? + yu’)? 


—g? 1/a k? 
= —— dk——=——— + finit 
1672 | T inite 
= Te” l/a + finite as a-0. (3.1.4) 


Thus wecan interpret the divergence as follows: Let È be (i times) the sum of 
self-energy graphs. (As usual, the self-energy graphs are graphs for the 
propagator that have the external lines amputated and that cannot be split 
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Fig. 3.1.5. Summation of self-energy graphs into propagator. 


into disconnected parts by cutting a single line.) Then the full propagator is 
G,(p”) =i/(p? — m? — È + ie). (3.1.5) 


This equation is illustrated in Fig. 3.1.5; the propagator is the sum of a 
geometric series involving the self-energy. The actual mass m,, of the 
particle is determined by the pole position, p? = m;,,. Evidently mŝ, is not 
m? but m? + X(p* = m2, m°). In other words, the self-energy represents 
the dynamical contribution to the mass coming from the interactions. The 
divergence (3.1.4) is independent of p?, so it is precisely a contribution to the 
mass. (We ignore higher orders for now.) 

Traditionally, one observes that it is convenient to parametrize the 
theory, not by the mass parameter m that cannot be observed directly but 
by the physical mass m,,,. One writes the mass term in 2 as 

— moo? /2 = — m2, o7/2 — dm* p? /2. (3.1.6) 
The first term is left in the free Lagrangian, so that the free propagator is 
i/(p? — Mon + ie). But the second term — called the mass counterterm — is 
put into the interaction Lagrangian, and adjusted so that the full 
propagator has a pole at p* = Mîn. The counterterm exactly cancels the 
dynamical contribution to the particle’s mass. This is the basic idea of 
renormalization. It is physically irrelevant that 6m? happens to diverge. 

In perturbation theory, ôm? is determined as a power series in g. To O(g) 
we have, in addition to Fig. 3.1.1, the graph of Fig. 3.1.6 corresponding to 
the mass counterterm in (3.1.6). The self-energy to O(g?) with the new 
parametrization is the sum of Figs. 3.1.1 and 3.1.6. We call it the 
renormalized self-energy: 


Lin = {2 (P; mina) + 6m’ } lao 
= Lipin(P? 5 Men ; y, 0) + [dm* + Z aiv]. (3.1.7) 


—— 


Fig. 3.1.6. Counterterm graph to the self-energy. 


We adjust ôm? first of all to cancel the divergence in È 4;,, so that the term in 
square brackets is finite as a0. Then we adjust the finite part so that 


Ôm? + Eiai = — È ifin (Mfrs Mon» H4). (3.1.8) 
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For the renormalized self-energy we find 


ig? fa 1 1 
Zir = z773 [dk 2 243 2 eee e 
327 (k* — m3, + ie)[(p +k) — m3, +ie]  (k? — p? + ie)? 
— value at p? = må.. (3.1.9) 


Note that the u-dependence cancels. 
Equation (3.1.9) gives the value of the self-energy in the continuum theory 
correct to O(g7). 


3.1.4 Computation 


One way to calculate Èg is to differentiate with respect to p°. (It is of 

course a Lorentz scalar.) Integrating the result gives 2 ,,; the constant of 

integration is fixed by the renormalization condition that X,,(m>,) is zero. 
We have: 


ôE P O y 
Op? 2p? Ap" 1R 
EE. 
ap | eam Fig fip+ ky mvs OOO” 


This is identical to what we would have obtained from the unrenormalized 
expression (3.1.1) without regard to the fact that it is divergent. We could 
have written it down directly without going through the long explanation 
that we used. But then there would have been no defense to the argument 
that we are manipulating meaningless quantities and that therefore 
quantum field theory makes no sense. 

Since (3.1.10) is finite it can be easily calculated by using a Feynman 
parameter representation and then by shifting the k-integral: 


OZR = ig? | xx [ats p(p +k) 
0 [mon 


ôp? 16n*p? — p*x — 2p-kx — k? — ie]? 
ig? f p[k + p(l — x)] 
d a : 
p | xx |a i mo, — p°x(1 — x) —k? — ie]? 


sg aA te x(1 — x) 
7 | ara 5 p°x(1 T x)] 


1 


atl dx In [m2, — p?x(1 -»]}} (3.1.11) 
nN o 
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Using the condition Z,, =0 at p? = m5, now gives 
2 mi X enn 
Zir =— | dxin we | (3.1.12) 
32n“ Jo msa (l — x+ x*) 


This integral can be worked out analytically. 


3.2 Higher order 


The graph of Fig. 3.1.1 is not the only divergent graph in the theory. In 
Chapter 5 we will discuss the general theory of renormalization and we will 
see how to extend the removal of divergences to all orders. In this section we 
will only consider a class of graphs which have divergences generated 
because Fig. 3.1.1 occurs as a subgraph. Examples are given in Fig. 3.2.1. 

One property should be clear. This is that the divergences come from 
subgraphs all of whose lines are part of a loop. A general way of 
characterizing these subgraphs is to define the concept of a one-particle- 
irreducible graph or subgraph. A one-particle-irreducible (1 PI) graph is one 
which is connected and cannot be made disconnected by cutting a single 
line. A graph which is not 1PI is called one-particle-reducible (1PR). The 
graphs in Fig. 3.2.1 are all one-particle-reducible, since they all have one or 
more lines that when cut leave the graph in two disconnected pieces. The 
self-energy subgraph of Fig. 3.1.1 consisting of the two lines in the loop is 
1PI. This identical subgraph occurs several times in the graphs of Fig. 3.2.1. 

We introduced a mass counterterm into the interaction, so that the 
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Fig. 3.2.1. Graphs containing the one-loop self-energy as a subgraph. 
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(a) (b) 


(c) 
Fig. 3.2.2. Counterterm graphs to Fig. 3.2.1. 


counterterm graph Fig. 3.1.6 cancels the divergence in Fig. 3.1.1. Clearly 
the counterterm vertex can be used as an interaction anywhere in a graph. 
In fact, all graphs containing it can be found as follows: (a) take a graph with 
the loop of Fig. 3.1.1 occurring as subgraph one or more times, but with no 
mass counterterm vertices; (b) replace one (or more) of the occurrences of 
the loop by the counterterm. The terms generated from Figs. 3.2.1 (a)—(c) 
are shown in Fig. 3.2.2. 

Evidently, the sum of the original graph and its counterterm graph(s) 1s 
just the original graph with every occurrence of the loop replaced by its 
renormalized value — iÈ g. It is sensible to keep the counterterm associated 
with the loop, thereby considering the loop plus the counterterm as a single 
entity. 

In the case of the graphs of Fig. 3.2.1 the result of this procedure is to 
make the graphs finite. The generalization to an arbitrary graph will be 
worked out in Chapter 5. 


3.3 Degree of divergence 


We saw how the Wick rotation ensured that the UV divergence of the one- 
loop self-energy is a purely Euclidean problem. The divergence is from the 
region |k“|— co, without any regard to direction. Thus, simple power- 
counting determined that there is a logarithmic divergence. The power- 
counting involves merely counting the powers of k in the integral for large k. 
The divergence is logarithmic as the lattice spacing, a, goes to zero. 
Power-counting in this fashion works for a general graph to determine 
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what is called either the ‘overall degree of divergence’ or the ‘superficial 
degree of divergence’. This we will discover in Chapter 5. There we will also 
see how the value of the degree of divergence determines the particular 
counterterm vertices needed for a theory. This will enable us to determine 
whether or not a theory is renormalizable by invoking arguments revolving 
around the dimension of the coupling. 

In this section we will vary the dimensionality, d, of space-time in the 
calculation of the self-energy graph, Fig. 3.1.1. We can then explore the 
relation between the degree of divergence, as determined by power- 
counting, and the momentum dependence of the counterterm. 

The integral for Fig. 3.1.1 is now 

g ig? 1 
2, (p d= 9 (ho aro Sa Ey (3.3.1) 
The space-time has one time dimension and d — 1 space dimensions. In the 
Feynman rules the factors (27)* get replaced by (27)*, since they arise from 
the result 


[atvet = (2n)46(k), (3.3.2) 


The number of powers of k in the integral is now d — 4; we call this the 
degree of divergence of the graph. If d is less than four, then the graph is 
convergent. But whenever d is greater than or equal to four, the graph 
diverges. Our discussion in the previous sections tells us we must try to 
renormalize it by adding a counterterm. 

Now, differentiating once with respect to p” gives convergence if d = 4: 


0x, - ig? fe (p+ k), (3.3.3) 


ap" Gay J Elot E 

with the degree of divergence of the integral being reduced by one to d — 5. 
If d = 5 the integral diverges logarithmically. However one might surmise 
that the divergence comes from the piece of the integrand proportional to k, 
and that symmetrizing by k —> — k would kill the divergence. This is in fact 
true, but let us be more simple-minded. 

Differentiating again with respect to p gives a result with degree of 
divergence d — 6: 


3'E, _ ig? gap t2lP + k) (p + k), — gal + k)? — m* J} (834) 
Op"dp’ (2n) (k? — m?)[(p +k)? — m7]? 

To recover Łęg at d=5 we integrate twice. There are more constants of 

integration that appear as an additive contribution of the form: A + Bp". 
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However, we must require Lorentz invariance of 2, so the B, term is 
eliminated, and we are left with a mass term as the only counterterm. This is 
the first and simplest example of the use of a symmetry argument to 
determine the form that we will allow for a counterterm. 


3.3.1 di atd=6 


Let us now go to d = 6. Differentiating X, three times gives a finite result 
with degree of divergence d — 7. Integrating to obtain 2, gives arbitrary 
integration constants of the form A — Bp*, where again we have used 
Lorentz invariance. If we went to the lattice we would find divergences 
proportional to 1/a”, m? In (a) and p°ln (a). The fact that these terms have 
dimension 2 corresponds to the fact that the integral for 2, has degree of 
divergence 2. To make it finite we must not only use a mass counterterm but 
also a counterterm proportional to p°; the total we will call ôm? — ôZ p°. 
This is generated by a counterterm 


— m? b?/2 + ôZ(ô¢)?/2 (3.3.5) 


in the Lagrangian. 

Evidently the value of the degree of divergence is reflected as the 
maximum number of derivatives or powers of p in the counterterms. 
Equally, it is reflected in the integration constants that appear when we 
recover È g from the differentiated £,. These two phenomena happen for a 
general graph, as we will see later. The method of proof will in fact be to 
differentiate each graph enough times with respect to its external momenta 
until it is finite. 

The (ô¢ġ)? counterterm in (3.3.5) is of course an example of the wave- 
function renormalization introduced in Section 2.3. We can interpret it 
physically by examining the propagator. The propagator for the bare field 
can be expressed in terms of the propagator of the renormalized field : 


G2) = <0| T $o (P)o(0)|0> 
=Z<0|Tġ(p)ġ(0)|0> 
=iZ/(p* — ms, — Eio) — ÔM? + dZp? + ie) 
=iZ/(p? — m3, — Zip). (3.3.6) 
Note the distinction between bare and renormalized fields. The residue of 
the particle pole in the propagator of an interacting field is in general not 
unity: 
Goo) = iRio)/(p? — m3, + ie) + finite, as p?—>m%,. (3.3.7) 


Examination of its spectral representation demonstrates that 0 < R(0) < 1, 
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because the field ¢, has canonical equal-time commutation relations. (The 
proof is given, for example, in Section 16.4 of Bjorken & Drell (1966).) 

Now we can always change the definition of ¢ = Z~"? ġo by multiplying 

Z by a finite factor. So it is possible to adjust Z so that the renormalized 

propagator has a pole of unit residue at p? = m?,: | 

G, =i/(p* — mą + ie) + finite, as p?—>m),. (3.3.8) 

In this case we identify Ro) with Z. The renormalized self-energy satisfies 

Lig = OL /ôpþ?=0 at p =m}. (3.3.9) 

When integrating to obtain Łęg from the finite derivative of X,, this 


condition enables the integration constants to be determined. It is called the 
mass-shell renormalization condition. 


3.3.2 Why may Z be zero and yet contain divergences? 


As a property of the exact theory we know that 0 < Z < 1, if we adopt the 
renormalization condition (3.3.9); Z is definitely finite. However its 
perturbation expansion starts at 1 + g*[C ln (a) + finite] + ---. (Here C is a 
constant.) This seems to be infinite as a— 0, rather than finite. We resolve 
_ the contradiction by realizing that we should not expect higher-order terms 
to be small if the one-loop correction is large. For example, we could have 


the series 
1 D 
a f — g°[ (C/D) 1n (a) + i ? Cane 


where D is a positive number. Then Z —>0 as a —> 0 even through the one- 
loop term goes to infinity. In any event we see that C must be positive 
(otherwise if we fix a and let g >O there is a region with Z > 1). 

It would appear impossible to derive a formula like (3.3.10) since it 
involves summing all orders of perturbation theory. Moreover it involves 
an analytic continuation from within its radius of convergence 
|In(a)| < D/g?C to |In(a)| = œ. This seems to make no sense at all since 
perturbation series are in general asymptotic series rather than convergent 
series. However, we will see in Chapter 7, on the renormalization group, 
that we can find a systematic method of calculating Z and the other 
renormalizations in the limit a—0. It relies on the so-called asymptotic 
freedom of the theory. The behavior (3.3.10) will turn out to be essentially 
correct, in asymptotically free theories. 

No matter what the truth is, it should be clear that the divergences in 
perturbation theory as a—0 need not be reflected as divergences in the 
exact theory, but only as singularities. 
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3.3.3 Renormalizability and non-renormalizability 


Suppose we now goto a dimension d > 6. For example, let us set d = 8. Then 
the one-loop self-energy has a quartic divergence. The necessary counter- 
terms contain up to four derivatives of the field: 


— 5m? 2/2 — 5Z(6)?/2 + E(Ob)?/2. (3.3.11) 


The quartic term is not of the form of any term in the original Lagrangian, 
so it cannot be obtained by renormalization of the Lagrangian. When this 
situation occurs the theory is called non-renormalizable. Non- 
renormalizability is a priori a good reason for dropping the theory from 
consideration. There are possible ways to avoid this, but we will leave 
discussion of this until later. 

There is a simple argument that helps in the determination of whether or 
not a theory is renormalizable. It links dimensional analysis and power 
counting. Consider a one-particle-irreducible graph I. Let its degree of 
divergence be ô(T`), and let its mass dimension be d(T). Now I is the product 
of a numerical factor, a set of couplings, and an integral. As we see from 
(3.3.1) the degree of divergence is the dimension of the integral. Therefore, if 
we let A(T) be the dimension of the couplings in T, we have 


d(T) = d(T) + A(T). (3.3.12) 
Now consider the counterterms to I’. These form a polynomial of degree 
6(I-) in the external momenta. For each term C in the polynomial, let 6(C) be 


the number of derivatives and let A(C) be the dimension of the coefficient, so 
that its dimension is the same as T: 


| 6(C) + A(C) = d(T). (3.3.13) 
Now, the maximum number of derivatives in the counterterms 1s 
ôl) = d(T) — A(T). 
If the couplings have negative dimension, then 6(I-) can be made arbitrarily 
large by going to a graph of high enough order. In the absence of 
miraculous cancellations this tells us to expect non-renormalizability. If the 
couplings have zero or positive dimension, we have a finite number of 
counterterm vertices, since — A(T) is bounded above by zero and d(T) 
decreases as the number of external lines increases. 
If the couplings never have negative dimension, we observe that the 
coefficients of the counterterms satisfy 
A(C) = d(T) — 6(C) 
> d(T) — d(T) 
= A(T), 
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so that these coefficients also have non-negative dimension. This, in the 
simplest cases, is sufficient to ensure renormalizability. 

For example, in the ¢° theory ¢ has dimension d/2 — 1. (Recall that Z 
has the dimension of an energy-density.) Then A(g) = 3 — d/2. So if d > 6 the 
theory is non-renormalizable, as we saw by example. If d < 6, there are only 
a finite number of possible counterterms, since these are restricted to have 
coefficients of non-negative dimension. 


3.4 Renormalization group 


3.4.1 Arbitrariness in a renormalized graph 


The infinities of a renormalizable theory amount to divergent dynamical 
contributions that renormalize the parameters in the Lagrangian. 
Traditionally one thinks of renormalization as the procedure of working 
with measured quantities instead of the corresponding bare quantities. The 
most obvious case is that of the mass m,,, of the particle corresponding to an 
elementary field. However to take the traditional view is much too 
restrictive. 

This issue can be understood by looking at strong interactions. There we 
have a theory, QCD, in which free particles corresponding to the 
elementary fields do not appear to exist (pace LaRue, Phillips & Fairbank 
(1981)). So arises the hypothesis of quark confinement — not proved from 
QCD, so far — according to which quarks are never isolated particles. Even 
so, the theory has quark masses, which can be measured (up to considerable 
uncertainties for the light quarks). But one cannot identify these masses 
with the directly measurable masses of free quarks. One must only speak of 
mass parameters, measured, in this case, rather indirectly. 

There is no problem in taking this point of view. For example, we write 
the @* Lagrangian as a basic Lagrangian plus a counterterm Lagrangian: 
L = 07/2 — m? o?/2 — go?/3! — Z 07/2 — 8m? 7/2 — 6gg?/3!. 

(3.4.1) 
But we avoid identifying the renormalized mass with the mass m,, of a 
particle. Similarly we do not identify the renormalized coupling, g, with any 
specific measured quantity, and we do not define Z by requiring that the 
residue of the propagator’s pole be unity. 

Consider the calculation of the one-loop self-energy at d= 4. We can 
choose 6Z = Ofor this case; the only divergence is in dm’. The renormalized 
self-energy (3.1.7) is 


Lin = Litin + (6m? + Eiai). 
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We must choose ôm? to have a divergent part to cancel the divergence at 
a= Qof È y. But the finite part of ôm? is not determined; the arbitrariness 
is the same as that of the integration constant when obtaining L,, by 
integrating 0L,/¢ép". 

At first sight it might appear that the arbitrariness ruins the theory unless 
one pins down m to be the physical mass m,,,. This is in fact not so; the 
arbitrariness is more like the arbitrariness in choosing a coordinate system. 
Suppose one first computes the propagator with the mass-shell condition 
Lir(p* = m2.) = 0. Then 


G, = i/[p? = Mon = Dip) 25 È itin P? = Mon) T O(g*)], (3.4.2a) 


m = Mon aeo T Lt tin(M>n) + O(g*). (3.4.2) 

One could also compute with a different finite part to m7, with a result 
G, =i/[p? — m* — Xygiq(p?,m7) + g?C + O(g*)], (3.4.34) 

me =m? — E aiy —97C + O(g?) (3.4.3b) 


where C is any chosen number. The self-energy is now 

wel’, m?) = icin + gC. 
Evidently the two ways of renormalizing the theory give the same results if 
we require that the bare mass m@ is the same in both of (3.4.2) and (3.4.3). In 
the complete solution of the theory, say by the functional integral, it is only 


m< that matters, not the partition into a renormalized mass squared m? and 
a counterterm — Z,,,,—g7C. Clearly we have 


m? = ms, + 97°C — Xypin(mn) + OG"), (3.4.4) 


with the O(g*) terms depending on the renormalization of higher-order self- 
energy graphs. 

We come then to the central idea of the renormalization group. The 
arbitrariness in the definition of Èp is physically irrelevant, for a change in 
the arbitrary constant C can be exactly compensated by a change in m”. A 
change in C merely gives a different parametrization of the set of theories 
that can be obtained by varying the mass parameter m. The renormali- 
zation group is the set of transformations on the parametrizations of the 
theory. The transformations are accomplished by moving parts of the terms 
in Y from the basic Lagrangian to the counterterm Lagrangian. In the case 
of m it is a move from the free Lagrangian to the interaction Lagrangian. 
This of course gives a rearrangement of the perturbation series, which is the 
key to the many practical applications of the renormalization group. 

It might be objected that 


p? — m° — Xigin(p?,m?) + g°C 
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is not equal to 
pm Dial mon) + Eirin (Min Men)s 

since the mass parameters in ©,,,,(p) are different, whereas the theory 
parametrized in either way is the same. But since m3, — m? is O(g’) the 
difference in the two expressions is in fact O(g*). Thus the rearrangement of 
the perturbation series does not leave the p?-dependence of the coefficients 
invariant. The O(g*) terms will cancel the difference (up to even higher- 
order terms), etc. 

The utility of the renormalization group is precisely in its ability to 
reorganize the perturbation series. Since one effect of the interaction is to 
induce dynamical contributions to the mass and couplings, it is evidently a 
good idea to arrange that these contributions are small. The result is to 
reduce the values of higher-order corrections and thus improve the 
reliability of a perturbative calculation. 

Now the effective size of the dynamical mass or coupling must be treated 
as dependent on the situation under consideration. This can be seen by 
examining Èg given in (3.1.12) at large p°: 

Liz ~ (97/3227) [In( — p*/m3,) + constant + +]. (3.4.5) 
If |p? | is large enough this can be large. Since the graph occurs as a subgraph 
of higher-order graphs, it is likely (and often is true) that higher-order 
graphs are as important as low-order graphs at large enough p’. This 
situation is undesirable and can be remedied by a renormalization-group 
transformation. 

We absorb the large part of £,, into a redefinition of the renormalized 
mass m?. We must examine higher-order graphs at large p* to demonstrate 
that there are no further sources of large coefficients. We will do this 
systematically in Chapter 7. 


3.4.2 Renormalization prescriptions 


There are infinitely many ways of resolving the ambiguity in constructing 
the counterterms for a given theory, each of these ways corresponding to a 
particular parametrization. It is essential that, whenever a particular 
divergent graph occurs as a subgraph of a bigger graph, the ambiguity is 
resolved in the same way at each occurrence, since the corresponding 
counterterm vertex is generated by a single term in the Lagrangian. So to 
perform concrete calculations one adopts some rule to resolve the 
ambiguity. Such a rule is called a renormalization prescription or 
renormalization scheme. 
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Of the infinitely many possible renormalization prescriptions, a few have 
become standard, because they are especially convenient either for practical 
use or for theoretical considerations. In this section we will explain two of 
the standard ones with the aid of the example of the one-loop graph 
Fig. 3.1.1. 

We have already encountered the mass-shell, or physical, scheme. The 
renormalized mass is defined to be the physical mass, i.e., the position of the 
propagator pole. Wave-function renormalization is fixed by requiring the 
residue of the pole to be unity (see (3.3.9)). Couplings can be defined by 
specifying the value of a suitable S-matrix element. 

A possibility that is much used in discussions of renormalization theory is 
the BPH or BPHZ scheme (Bogoliubov—Parasiuk—Hepp—Zimmermann), 
otherwise known as zero-momentum subtraction. Let I be a one-particle- 
irreducible (1PI) graph that is divergent, i.e., it has ô(T) >0. The pre- 
scription is that at zero external momentum its renormalized value R(T) 
and its first ô(T`) derivatives with respect to external momentum are zero. 
The BPHZ scheme is to implement this by subtracting off the first ô(T) 
terms in the Taylor expansion of the integrand (Zimmermann (1970)), that 
is, the renormalization is performed before the integration over loop 
momenta. No explicit UV cut-off is needed. In this scheme the self-energy 
already discussed is, at d = 4, 


(BPHZ) 19 |44,)  — 1 = 
2 iR a 327+ fa ir an m?)((p + k)? = m°)] (k? _ m7)? \ (3.4.6) 


while at d = 6, we have: 


5 (BPHZ) eee Se a a a 
IR 12876 (k? — m?)[(p +k)? —m?] (k? — m? + ie)? 
2p'k [4p-k? — (k? — m?)p? ] 


$ (k? cA m?) (k? — m?7)* (3.4.7) 


3.5 Dimensional regularization 


In our initial treatment of UV divergences we used the lattice as a cut-off, or 
regulator. However, what we are really interested in is the renormalized 
theory with no cut-off. We could equally well use some other kind of 
regulator. For example, a Pauli—Villars type of cut-off is achieved by a 
higher derivative term in the Lagrangian. For example, from 


Z =0A?/2—m?A2/2 — [ (O + m?)A]?/AM? — m?) 
— gA?/3! + counterterms (3.5.1) 
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we obtain the free propagator (2.4.1). When the cut-off M goes to infinity, 
the propagator is the ordinary one, but when p? is much bigger than M7?, it 
is smaller by a factor p?/M?. Thus UV divergences are cut-off for the theory 
in six or fewer space-time dimensions; the divergences reappear when we 
take the limit M > oo. 

If we defined the theory by a functional integral the lattice would appear 
as an intermediate step, but the a > 0 limit would give no divergences, if M is 
finite. Although the Euclidean Green’s functions for the cut-off theory 
(3.5.1) exist, the Minkowski space field theory is not physical. A symptom of 
this is that the pole of the free propagator at p? = M? has the wrong sign of 
residue; it implies a particle with negative metric. 

A theory with no cut-off can be obtained by adding counterterms with 
appropriate M-dependences to cancel the divergences and then taking the 
M > œ limit. As an example we showed that counterterms cancelled the 
divergences of the one-loop self-energy graph. Although we assumed a 
lattice regulator, we used no properties of the lattice propagator that are 
not true for the Pauli—Villars case. We assumed only that: 


(1) Ifthe cut-off is taken away (1.e., M —> œ or a > 0) with p and mfixed, then 
the propagator goes to i/(p? — m°). 

(2) If p?— œ with fixed cut-off, then the propagator is sufficiently much 
smaller than 1/p* that the graph is not UV divergent. 

(3) In the Euclidean region there are no propagator poles. 


In principle, any method of imposing a UV cut-off is good enough, but in 
practice some methods are more convenient than others. For most 
purposes dimensional regularization is the most convenient. The method 
starts from the observation that UV divergences are eliminated by going to 
a small enough space-time dimension d. We can use the space-time 
dimension as a regulator provided we treat d as a continuous variable (so 
that the cut-off can be removed by taking the limit d > 4). This idea has a 
long history, but its popularity roughly started after the papers by Wilson 
(1973) in statistical mechanics and by ’t Hooft & Veltman (1972a), Bollini & 
Giambiagi (1972), Ashmore (1972), and Cicuta & Montaldi (1972) in field 
theory (especially non-abelian gauge theories). 

Since vector spaces of non-integer dimension do not exist as such, it is not 
obvious that the concept has any consistency, let alone validity, even in a 
purely formal sense. This we will remedy in the next chapter. For the present 
we will assume uniqueness and existence, and apply standard manipu- 
lations to the integral 


omoes iie ee ee 
apa iae 2(2n) re — m? + ie)[(p +k)? — m? + ie] A 
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until it is of a form where we can sensibly assign a value. We will (following 
Wilson (1973)) express (3.5.2) in terms of a standard Gaussian integral: 


far exp (k?) = i fao far 1kexp(— w? — k?). (3.5.3) 


It is sensible to give this the value ix”, which is correct if d is an integer. In 
the following calculation of the value of (3.5.2) with non-integer d, the 
assumed properties of the integration are italicized. All the manipulations 
are valid for any integer value of d for which the integral converges. 

We use the Schwinger representation for each propagator: 


1/(m? — k? — ie) = | a exp[ — a(m* — k? — ie) ]. (3.5.4) 


Observe that because of the Wick rotation we treat k? as negative. Then we 
it the order of integration to obtain 


=|" da db [akexp[ = (a + bym + bp? + 2bp-k + (a + b)k?]. 
(3.5.5) 


= = 4 


We shift k" by an amount p”b/(a + b) and change variables to z =a + b, 
x =a/z to get: 


ig? 1 œ 
£L =——— | d dzz |d*k — 2 — p*x(1 — a, 
i se. a z| exp { — z[m* — p*x(1 — x)] + zk? } 


After _— k by a factor z'/* we find that 


l, ax |" dzz'~“/* exp { — z[m? — p*x(1 — x) ]} |d*kexp(k7). 
(3.5.6) 


It is this stage which brings in the dimensionality d. We have now reduced 
the d-dimensional integral to the form (3.5.3), which we defined to be ix”. 
The z-integral in (3.5.6) gives a I’-function, so we finally obtain: 


1 = 20 


g? 


Z= Ir A am | dx[m? — p?x(1 — x)]”27?. (3.5.7) 


This result is unique (except possibly for an overall normalization, which is 
universal — the same for every d-dimensional integral). The divergences 
now reside in the I -function which has simple poles at d = 4, 6, 8,.... The 
residue of each pole is a polynomial in p of degree equal to the degree of 
divergence. 

One of the main advantages of dimensional regularization isimmediately 
apparent. Not only was the integral unchanged from its form in a theory in 
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integer dimensional space-time with no cut-off, but the method of 
calculation was unchanged. Use of the representation (3.5.4) is an efficient 
way of obtaining a parametric representation like (3.5.7) for a Feynman 
graph. 

A second advantage that we will see later is that it preserves not only 
Poincaré invariance in the regulated theory, but also gauge symmetries. 
This was a main motivation for its use by °t Hooft & Veltman (1972a) and 
many others. Most methods of introducing a cut-off fail in this respect. (For 
example, gauge invariance is preserved on the lattice but full Poincaré 
invariance is lost.) 

A third advantage — also of great importance in practice —is that a 
continuous space-time dimension is also a gauge-invariant cut-off for infra- 
red divergences in theories with massless fields (Gastmans & Meuldermans 
(1973), Gastmans, Verwaest & Meuldermans (1976), and Marciano & 
Sirlin (1975)). A trivial example, but without any gauge invariance, is given 
by the ¢° self-energy with m=0 at d =2. It is 


1g d2 1 

8n? (k? + ie) [(p + k}? + ie]’ 
which is divergent at k = 0 and at p= — k. The divergence is regulated by 
increasing d. Care is required in using this method if UV divergences are 
present in the same graph, for they are regulated by reducing d. 


3.6 Minimal subtraction 


3.6.1 Definition 


From the unrenormalized self-energy (3.5.7) we compute the renormalized 
self-energy £,, at d=4 by adding a mass counterterm 6m7(g,m7,d) and 
then letting d->4. Suppose we choose m to be the physical mass. Then 


Ei = x1 (p*, Mn» g, 4) -È (Mons Mons J, d) 
g? 


~ (41 a aaf dx {[m2, — p?x(1 — x)]4/2>? 


—[m3,(1 — x + x?) ]”? 3}. (3.6.1) 
Now T(z) has a pole at z=0: 


F(z) = 1/z — yp + 0O (2), (3.6.2) 
where yẹ =0.5772...1s Euler’s constant. So at d = 4 
2 1 m? 2 
g hoP x(1 = x) 
R= ON a 3.6.3 
IR 3272 | m mi (l — x+ x?) } ( ) 


in agreement with our earlier calculation. 
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Since the divergence in 2, amounts to a simple pole at d = 4, a rather 
obvious way of renormalizing it is to define dm? to cancel just the 
singularity, 1.e., the pole (t Hooft (1973)). This, of course, means that we are 
changing our renormalization prescription. By referring back to (3.5.7), we 
see that dm? in this scheme is: 


dm? = (g2/32n7)/(2 — d/2), (3.6.4) 


from which we get Xp by expanding 2, in a power series in d — 4. We find 


2 1 2 2 
g m* — p“ x(1 — x) 
2M saf ax} nf eea | $ yet (3.6.5) 


Unfortunately this contains the logarithm of a dimensional quantity. The 
reason is that in the expansion in powers of d — 4 we did not allow for the 
fact that g has a dimension dependent on d. Therefore we implicitly 
introduced a mass scale. 

To make this scale explicit, we rewrite the coupling 


gaye ig, (3.6.6) 


where we have introduced a parameter u with the dimensions of mass, 
called the unit of mass (t Hooft (1973)). The redefined coupling g now has 
fixed dimension equal to 1, and the renormalized self-energy becomes 


2 1 2 2 
g m” — p*x(1 — x) 
MS) — OX (lal foes a 3.6.7 
IR 327? |. a| Arp |+ ref ( ) 


We derived this by observing that 
2-d/2 _ ẹ(2-4/2)lInų 
= 1 + (2 — d/2)ln u +4(2 — d/2}} ln ?u ++.. 
This renormalization prescription, where counterterms are pure poles at 
the physical value of d, is called minimal subtraction (MS). 

The unit of mass u is entirely arbitrary. Thus the self-energy (3.6.7) now 
depends on three parameters instead of two. However a change of u 
amounts to a change of renormalization prescription, so the change can be 
compensated, in this case, by a change in m. In effect minimal subtraction is 
a one-parameter family of renormalization prescriptions. 


u 


3.6.2 d=6 
We can also apply minimal subtraction to the six-dimensional theory. 
There we define 
ôm? = poles at d=6, 
6z = poles at d = 6, (3.6.8) 
ôg = u? “/*(poles at d = 6). 
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Note that the renormalized coupling is now dimensionless. Since 
I(z— 1)= — 1/z + yg — 1 + O(2) (3.6.9) 
as z => 0, we find that 


dm — p/6)(yg — 1) 


IR 12873 
— p*x(1 — x) 
-f dx[m? — p°x(1 -Jin ™ amir orc d (3.6.10) 
and 
g m? 

aan 4 3.6.11 
OM Cand | ae 
gen al 4 0194 (3.6.12) 

exin dp V3 - 


The counterterm ôg for the coupling can also be calculated. From the graph 
of Fig. 3.6.1, we find (Macfarlane & Woo (1974)) 


3 


g 
= pT l P) 3.6.13 
T ae + 000") (3.6.13) 


Fig. 3.6.1. One-loop vertex graph in ¢° theory. 


3.6.3 Renormalization group and minimal subtraction 


When we discuss the renormalization group in Chapter 7, we will focus on 
one particular subgroup. The transformations in this subgroup consist of 
multiplying u by a factor and making compensating changes in the 
renormalized coupling and mass. As a group it is trivial— being a 
representation of the group of positive real numbers under multiplication. 
What is non-trivial is the way in which it is represented in relation to the 
parametrization of the theory by a renormalized coupling and mass. 

The renormalization group can be exploited in calculating high-energy 
behavior. While a full treatment will be made in Chapter 7, the basic idea 
can be seen by examining the one-loop self-energy. Let p° get large (with m 
and u fixed), and consider the propagator defined using minimal sub- 
traction at d=6: 


E g? — p? 
G~ tat — lel + const. | + o} ) (3.6.14) 
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To be able to make use of a perturbation expansion we must keep higher- 
order corrections small. But this is not so in (3.6.14) if p? is too large. The 
large correction can be avoided by setting x? to be of order | p° |. The theory 
is unchanged if we make suitable changes in g, in m, and in the scale of the 
field. We will learn how to do this in Chapter 7, with the result that the large 
corrections are effectively moved from higher-order terms in the per- 
turbation series to the lowest-order graphs. 


3.6.4 Massless theories 


Let us return (for simplicity) to the self-energy of the four-dimensional 
theory. Consider the limit m— 0. If we use mass-shell subtraction, we have 
(3.6.3), which diverges as m3, — 0. 
` The divergence is an artifact of the mass-shell scheme, for which 
2 


g B t E 
ômĉ, = a ? — d/2)m$; i dx(1 — x + x? j”??? 


2 
= ee E “ii — In(m2,) + finite (as m,,, > 0,d > a| (3.6.15) 
In addition to the pole needed to cancel the UV divergence, there is a 
In (m2,,) term. 

Physically what happens is that in a massless theory there are long-range 
forces. These mean that separated particles are never completely free of 
each others’ influence. Thus, for example, the singularity in the propagator 
is not a simple pole, for the self-energy (with MS subtraction) is, from (3.6.5), 


g? -= p’ 
ane N 9 |. 3.6.1 
sal (ie we J ae 


The mass-shell renormalization prescription relies on the assumption of a 
simple propagator pole to generate counterterms, so it must fail. However, 
the nature of the propagator’s singularity is an infra-red problem, so it is 
irrelevant to the question of whether an ultra-violet divergence can be 
renormalized. Some other renormalization scheme, like minimal sub- 
traction, must be used in the massless theory. 


3.7 Coordinate space 


A good way to understand the infinite renormalizations is to work in 
coordinate space, as was emphasized by Bogoliubov & Shirkov (1980). This 
point of view is especially useful in treating field theories at finite 
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temperature or on a curved space-time background, as we will see in 
Chapter 11. There we will see why the counterterms are the same as at zero 
temperature in flat space-time. 

For most ordinary calculations, it is cumbersome to work in coordinate 
space, because the propagator S,(x) for a free massive field is a Bessel 
function. In momentum space the propagator is simple: i/(p? — m? + ie). 
However the asymptotic behavior of S,(x) as x +0 is simple. 

We have 

T'(d/2 — 1) 


= Gna xt + less singular as x > 0. (3.7.1) 


The one-loop correction to the propagator G,(x, y) is 


G22 


— (9°/2) | d“z | d wSp(x — z)Sp(z — w)’ Sp(w — y) 


[ate fasws — z)® (z — w)S,;(w — y), (3.7.2) 


where $, (z — w) is the self-energy in coordinate space: 
Ê (z — w) = — (g?/2)S,(z — w}. (3.7.3) 


This is singular at z = w, and causes a logarithmic divergence in (3.7.2), 
where we integrate over all z and w. The fact that the divergence is from the 
region z ~ w means that it is in fact a 6-function: 


G22 = [azsa — z)Sp(z — y) 


df am 59 | at + finite. (3.7.4) 
w~0O 


By Wick rotating the w®°-integral and using the following result (next 
chapter) for the d-dimensional integral of a Lorentz-invariant function 


| dêwf (w?) = — i[2n4/2/T(d/2)] | dia TT (3.7.5) 
(0) 


we find 


ig? 


—— ~ + finit 3.7.6 
wrda E ( ) 


G22 = [atesets — z)Sp(z — y) 
as d— 4. 
Evidently, the divergence is cancelled by adding a 5-function to £, : 
ig’ 
E E ee 
R 1 167?(4 — d) 


Mm 


5(w — 2), (3.7.7) 
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which is exactly the mass counterterm computed earlier by momentum- 
space methods. 

The important point, which is in fact true for an arbitrary graph, is that 
any UV divergence comes from a region in coordinate space where several 
interactions occur very close to each other. The divergence can then be 
cancelled by a counterterm which is a 6-function in the positions of these 
interactions. If the divergence is worse than logarithmic, then the counter- 
term will include derivatives of the 6-function. In any event the fact that it is 
a 6-function means that the counterterm can be included as a local 
interaction in the action. The locality means that it is a product of fields at 
the same point. 

Since the singularity at x = 0 of the free propagator S,(x) is independent 
of the boundary conditions used to define it, we should expect, for example, 
that the counterterms used in thermal field theory are the same as at zero 
temperature. At non-zero temperature, the vacuum is replaced by a mixed © 
state, and the boundary conditions for S,(x) change. The momentum-space 
proof that the counterterms are temperature-independent is therefore made 
difficult, but the coordinate-space proof is unchanged. 
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Dimensional regularization 


We have seen how convenient it is to regulate the UV divergences of 
perturbation theory by continuation in the dimension of space-time. To 
date, no-one has shown how to use the method in the complete theory. But 
in perturbation theory, as we will now demonstrate, it 1s consistent and 
well-defined. Now all results obtained by this method can be obtained by 
other, more physical methods (say, a lattice regulator). But frequently much 
more labor is involved. This is not a triviality, for in complicated situations, 
especially in gauge theories, it enables us to handle the technicalities of 
renormalization in a simple way. 

The idea of dimensional continuation has been used for a long time in 
statistical mechanics (see, for example, Fisher & Gaunt (1964)). It became 
very prominent when Wilson & Fisher (1972) discovered the e-expansion 
and applied it to field-theoretic methods in statistical mechanics (Wilson 
(1973), Mack (1972), and Wilson & Kogut (1974)). In the e-expansion one 
works in 4 — e spatial dimensions, and expands in powers of e. At the same 
time, in a purely field-theoretic context, a need arose to find a way of 
regulating non-abelian gauge theories that preserved gauge invariance and 
Poincaré invariance. This led to dimensional regularization (t Hooft & 
Veltman (1972a), Bollini & Giambiagi (1972), Cicuta & Montaldi (1972), 
and Ashmore (1972)). Speer & Westwater (1971) had actually discovered 
the method earlier, but their paper is considerably more abstract, and had 
not attracted much attention. 

Now vector spaces either have infinite dimension or a finite integer 
dimension. So the concept of integration on a space of finite non-integer 
dimension, d, cannot be taken completely literally. Either it is a set of purely 
formal rules for obtaining answers or it is an operation that is not literally 
integration in d dimensions, but only behaves in many respects as if it were 
integration in d dimensions. It is not sufficient to treat it only as a set of 
formal rules (even though that is what it becomes in practice), because one 
must know that the rules are consistent with one another and with the 
algebraic manipulations one carries out on integrals. To show that no 
inconsistencies can arise, we must construct an explicit definition. 
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There are three issues to address: (1) uniqueness, (2) existence, and (3) 
properties. Uniqueness is necessary, to avoid the possibility of constructing 
two definitions, each definition being self-consistent but giving different 
results from the other definition. Existence, shown by construction of an 
explicit definition, is necessary to prove that no inconsistencies arise. Once 
having seen that integration in non-integer dimension can be defined, we 
cannot just assume that all properties associated with ordinary integration 
are true; indeed they need not be. 

So we also have to prove those properties which we need and which are 
true. We also must prove that the results agree with ordinary integration if d 
is an integer. 

These considerations are quite non-trivial, as can be seen by considering, 
for example, the anomaly in the Ward identity for the axial current 
its) = Wy"ysy in the gauge model (2.11.7). If the fermion masses are zero, 
then a naive application of the fermion equations of motion shows that the 
current is conserved: ô j;, = 0. In fact, the current is not conserved, as 
shown by Adler (1969, 1970) and Bell & Jackiw (1969). A counterterm can 
be added to j{;, to make it conserved, but only at the expense of removing its 
gauge invariance. 

Among the objects to be extended to d dimensions are the Dirac matrices 
(y* and y.). If we assumed the obvious generalization of their anticom- 
mutation relations, then for all values of d we would have 


py} = 2g" 71. 
Ys. 7"f =9, (4.0.1) 
ys = 1. 


But then we would be able to derive the false result that the anomaly for the 
gauge-invariant axial current is zero. So there has to be an inconsistency 
(t Hooft & Veltman (1972a)). More complicated problems in a similar vein 
arise when treating supersymmetric theories (Jones & LeVeille (1982)). 

In this chapter we will start by stating the axioms for d-dimensional 
integration given by Wilson (1973). These are sufficient to prove unique- 
ness. Our calculation of a one-loop graph in Section 3.5 was in fact a 
realization of the uniqueness proof for one particular integral. Then we will 
construct an explicit definition of d-dimensional integration. The vector 
space on which we work is in fact infinite dimensional. 

Unfortunately, the definition gives a divergent result in most cases, so we 
will next have to find a powerful enough extension (Section 4.2). We then 
prove some standard properties (Section 4.3). One particular result in- 
volves finding a definition of the metric tensor on an infinite-dimensional 
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space such that its trace is d rather than infinity. 

Then we will be in a position to derive some useful formulae (Sections 4.4 
and 4.5) for use in Feynman graph calculations. Finally we will show how to 
define Dirac matrices; this is obviously important if we are to be able to 
calculate consistently graphs containing the Adler—Bell—Jackiw anomaly. 

The utility of a precise definition such as we give is that if inconsistencies 
arise at some stage, then one can always go back to first principles to 
discover the error. 


4.1 Definition and axioms 


Let d be a complex number. We wish to define an operation that we may 
regard as integration over a d-dimensional space: 


{eter (p). (4.1.1) 


Here f(p) is any given function of a vector p, which is in the d-dimensional 
space. We will suppose that the space is Euclidean. (Minkowski space is 
regarded as a one-dimensional time together with a (d — 1)-dimensional 
Euclidean space.) Following Wilson (1973) we will give an explicit 
definition in which the space is actually infinite dimensional; it is the 
integration operation that gives the dimensionality. Making d a positive 
integer n will effectively insert a 6-function in the integration that will force 
all vectors involved in defining the function f(p) to lie in some n- 
dimensional subspace. 

What properties must we impose on a functional of f in order to regard it 
as d-dimensional integration? The following properties or axioms (due to 
Wilson (1973)) are natural and are necessary in applications to Feynman 
graphs: 


(1) Linearity: For any complex numbers a and b 


| d“p[af(p) + bg(p)] =a | d“pf(p) +b | d“pg(p). (4.1.2) 


(2) Scaling: For any number s 


| d“pf(sp)=s~4 | d'p f (p). (4.1.3) 


(3) Translation invariance: For any vector q 


[ores (p+q)= | d'p f (p). (4.1.4) 


We will also require rotational covariance of our results. 
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Linearity is true of any integration, while translation and rotation 
invariance are basic properties of a Euclidean space, and the scaling 
property embodies the d-dimensionality. 

Not only are the above three axioms necessary, but they also ensure that 
integration is unique, aside from an overall normalization (Wilson (1973)). 
In fact, they determine the usual integration measure in an integer- 
dimensional space (again up to normalization). The proof is simple: 

Use linearity to expand f (p) in terms of a set of basis functions. Choose a 
basis such as the functions 


fs aP) = exp [ — s*(p+4q)*]. (4.1.5) 
Then the integral of a basis function can be written in terms of the integral of 
one single function: 


| d'Pf. a (pPy=s * | d“pexp(—p’). (4.1.6) 


The integral of this one function sets the normalization. It is natural to 
require that the value be the usual one in integer dimensions and that we 
can write 


[apasa exp(— p? —q’)= {ot +42k exp(—k?). (4.1.7) 
Thus the normalization is given by 
[apexi — p?) = 7t. (4.1.8) 


An abstract uniqueness theorem is not sufficient for us. We also need an 
explicit formula so that a d-dimensional integral can be written as a 
sequence of ordinary integrals. This will be important in allowing us to 
prove standard properties of the integration. In addition it ensures that 
there exists a self-consistent definition. It is a priori possible that no 
consistent definition exists; the uniqueness theorem only applies if the 
integration operation exists. 

A function f (p) that we integrate could in principle be any function of the 
components of its vector argument. However, we do not, a priori, know the 
meaning of the components of, say, a vector in 3.99 dimensions. We will 
soon see that there are in fact infinitely many components. In practice, we 
will work with rotationally covariant functions. So we will assume that f is 
a tensor function of a finite set of vectors: p, q,,..., qy say. For example, a 
scalar function is a function only of scalar products 


f =f (P, Pq). (4.1.9) 
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Thus f is in fact an ordinary function of scalar numbers, rather than some 
more complicated kind of function. Of course the values of the scalar 
products lie in restricted ranges. Thus: 


q, 20, 


[qaa] S424. (4.1.10) 


A tensor function is obtained by writing explicit tensors in terms of the 
vectors p, q,,...,@, and of the metric tensor 6‘/, with scalar coefficients. For 
example, we might have 

J P,a) = q'p’f,(P*, P-4.4°) + 6'"f,(P*, P’4, 9°). (4.1.11) 

Such functions are the most general that we need to consider. (We will see 
later how to handle the antisymmetric tensor ¢ and the Dirac y- 
matrices.) 

To give a realization of the objects p,q,,..., we assume that they are 
vectors in an ordinary vector space. The space must be infinite dimensional, 
as we will show in a moment. So we define the vectors each to be an infinite 
sequence of components, p=(p’,p’,...), just as we can define a three- 
dimensional vector V as a sequence of three components (V1, V7, V3). The 
metric is given by: 


KALY 


Pq=pqt+pq +: 

The reason for the infinite dimensionality is that an integral with, say, 
d = 3.99 can be used not only as a regulator for a physical theory in a space- 
time of dimension d, = 4, but also as a regulator for a model theory in any 
higher dimension, e.g., dọ = 5 or 6or.... The vectors q,,q,,...in (4.1.9) can 
be thought of as momenta of external particles, and our vector space must 
be large enough to accommodate d, linearly independent momenta. Since 
dọ is arbitrary, we are forced to infinite dimension. 

To define the d-dimensional integral of a scalar function, we find a finite- 
dimensional subspace containing all the q,’s. Then we write p as a 
component p, in this space and an orthogonal component p;: 

P=P, + Pr 


J 
= Ý ple; +p. (4.1.12) 
j=1 
The ‘parallel space’, in which lie the q ;’s, is spanned by an orthonormal basis 
e, (with j = 1,...,J). We define the integral over p to be the ordinary J- 
dimensional integral over p, performed after integration in d — J dimen- 
sions Over p+: 


ferro = [apap [apso (4.1.13) 
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Since f(p) does not depend on the direction of p, we now can define 


ep. = Kas | dpp? "~+ f(p). (4.1.14) 
(0) 


Here K, (with v=d-— J) is effectively the area of the surface of a 
hypersphere in v dimensions. The value of K, is obtained by considering the 
special case where f is chosen to be a Gaussian — see (4.1.8) — with the result 
2K 
» T(v/2) 
Hence we have a definition of d-dimensional integration in terms of 
ordinary integration: 


| d'p f (p) = 


(4.1.15) 


InG- D2 
I'((d — J)/2) 
We must check that the result is independent of the choice of the subspace of 
the p,. We must extend the definition to handle the divergences at p;= 0 
when d is small, which we will do in Section 4.2. Then in Section 4.3 we will 
prove important properties of our definition. But first there are a couple of 
details to clear up. 

The J-dimensional subspace of p,’s is chosen subject only to the 
requirement that it include all q,’s. So it is possible to extend the space to 


include extra dimensions. To show this has no effect on the value of the 
integral we must prove 


aia Pi |. dp,p$~ 7 *f(p). (4.1.16) 


K, Í dpp’'ate")= | aK, | dpp} °g(p? +k?) (4.1.17) 
(0) (0) 


— œ 
for any function, g, which depends on a scalar argument. This equation is 
true since the right-hand side is 


2r” -12 ix v-1 o| a o= x) (4 1 18) 
ae N = ages, ai 
rg (Cn eae 


where p? = xp” and k? =(1 — x)p’. 

To show that different choices of the ‘parallel’ subspace have no effect on 
the value of the integral, we merely extend both spaces to a common larger 
space. The sole problem is that there may be a divergence in (4.1.18) at 
x = 0; this we will cover by Section 4.2. 

Up till now we have supposed f(p) is a scalar function. If it is a tensor 
fï (p), we work component-by-component. To define, say, the 
component 


{eres '2(p), 
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we take the parallel space to include the 1- and 2-directions and any vectors 
q; on which f‘/ depends. Then we proceed as before. 
For example, suppose f‘/(p) = p'p’g(p), where g is a scalar function. Then 


| f*?(p)d*p = | dp'dp? | d*~?p,p'p7g[(p')? + (p*)* + pz] =0, 
while | 


fars 11(p?)= | dp fe oria + p?) 


— © 


2n! 


~ dT(d/2) 


| dpp**'g(p’) 
(0) 


1 
z | d“pp’g(p’). 
Generalizing this result, we see that 


7 sii 
| d“pp'p/g(p*) = T | dpp? g(p?). 


More general cases are treated in Section 4.3. 


4.2 Continuation to small d 


The convergence of the definition (4.1.16) is d-dependent at p+ =O and 
pr = ©. It improves at p, = œ when d gets smaller, but it improves at p; = 0 
when d gets bigger. Even for a function that decreases exponentially at large 
p, and that is analytic for finite p, the defining integral has a divergence if the 
transverse space has a dimension d — J < 0; this is forced to happen if d is 
negative or zero. So our first task in this section is to find an explicit formula 
for the continuation of (4.1.16) to arbitrarily negative d. We will see that the 
p+-integral has poles whenever (d — J)/2is zero or a negative integer, but that 
these are cancelled by the zeros in 1/T((d — J)/2). 

We will then be able to adopt the resulting formula as a definition of the 
d-dimensional integral of a function for which (4.1.16) converges for no 
value of d. An examp'e of such a function is 


1 

ID = a 
(q, +p)’ + (q2 + p)? +m? 

The parallel space must be at least two-dimensional to accommodate q, and 

q, SO we may set J = 2. Then the transverse integral converges at p; = 0 

only if d>J=2, while the complete integral converges at p= œ only 


ifd <2. 
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We will have a definition that defines fars (p) for all small enough 


d. For larger values of d we define the integral by analytic continuation. In 
general there will be ultra-violet poles at certain values of d — just as in the 
Feynman graph we computed in Sections 3.5 and 3.6. 

To explicitly define the continuation to small d, it is sufficient to consider 
a function f (p°). Let us assume that f—> 0 rapidly enough as p> œ that 
d/2 


2n i 
d’ 2) = —— | dpp*” 'f(p? 4.2.1 
| pI) Tam |, °?? (p°) (4.2.1) 
converges at p > œ for some positive value of d. We also assume that f (p?) is 
analytic at p = 0. Then (4.2.1) converges and is analytic in d for some range 
0 < Red < dmax We define the integral for all other values of d by analytic 
continuation in d. Explicit formulae for the continuation to smaller d’s are 
constructed by adding and subtracting the leading behavior at p— 0. For 
example, the following formula gives the integral in the range — 2 < Red 
E 
Igi! ? 20 
d a 2 Ke d d— i ( 2) 
fa pf (p?) al pp’ f (Pp 


C 
C 
+| dpp°™'[ f(e) —fO)] florea} (4.2.2) 
0 
This is independent of the arbitrary constant C. 


When — 2 < Re d <0 we may let C — œ to obtain 


2n4/ 


dê oN 
| p/ (p°) Ta) 
while at d = 0 the zero in 1/[T(d/2) is cancelled by the pole term to give 


| dpp* *[f(p*) —f0)], (4.2.3) 
0 


|as (p*) = (0). (4.2.4) 


We extend this procedure to continue to — 2] — 2 < Red < — 2l for any 
positive integer l: 


Dri? oO 
| dpsed) =E | pp { £07) -SO -PSO 
= (POO /1!}, 
| d-™pf(p?) =(— 0) FO) (Saa 


This equation gives us the integral when — 2] — 2 < Re d< — 2l on the 
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assumption that the original formula (4.2.1) converges when d 1s just greater 
than zero. Suppose now that (4.2.1) diverges at p = œ for all positive values 
of d, but that fis power behaved as p > oo. Then it is sensible to adopt (4.2.5) 
as the definition of the integral. This particular definition is very important 
since we will use dimensional continuation to regulate Feynman graphs 
that are ultra-violet divergent at d = 4. The definition (4.1.16) applied to a 
Feynman graph frequently has a negative number d—4 of transverse 
dimensions in order to ensure ultra-violet convergence of the complete 
integral. Then we may apply the definition (4.2.5) to the p;-integral with d 
replaced by d- J. 

Another obstacle to continuation in d is sometimes that f(p’) is not 
analytic at p? = 0 but has a power-law singularity there. We may generalize 
the derivation of (4.2.5) to write down a formula for the continuation of the 
integral. 

An example of the use of (4.2.5) as a definition is given by choosing 


f(p*) = (p? + A)/(p? + B), 


where A and Bare numbers. The definition (4.2.1) diverges for all d, but with 
l = 1, (4.2.5) gives us a definition valid for — 2 < Re d <Q: 


d/2 © . 2 
dé Bo | dpp -eP tA A 
[env aal, Ppp a B I 


The integral can be explicitly computed to give: 


| d“p(p* + A)/(p* + B) = (2B)*(A/B — 1)T(1 — 4/2), 


which can be continued to all d. 
Suppose f has a power-law singularity, as for example 


1 
I= PFPP +m) 
The definition (4.1.16) of the integral of this function converges if 2 < d < 4. 
To continue it to lower d we must subtract the power behavior at p= — q, 


just as we did for singularities at p; = 0, or at p = 0 in (4.2.2). Then we can 
define the integral of say 


2 


eE ear 
(P +q} (p? + m?) 


One result of all these definitions is that the integral of a power of p is 
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Zero: 


| d4p(p?)* =0 (4.2.6) 


for any value of « (integer or not). It should not be thought that there is any 
choice in (4.2.6). It follows from (a) the explicit continuation of (4.1.16) to 
small d, and (b) application of the continued formula as a definition of 
[d*p(p*)*. 

Consistency of the formalism also requires (4.2.6). For suppose that f (p) 
= (p’)*/(p? + m°). Then when — 2a—2<d < — 2a we have 


| d'p f (p?) = | d’p[ f (p?) — (p?)*/m? J. 


If linearity is to be true then we have (4.2.6). 

We could subtract out the singularity differently, by a function that is not 
just a power of p. But then, for example, the simplification obtained in (4.2.2) 
by taking C —> œ would no longer occur. 

Observe that if in the first of our definitions (4.1.16), we take f to be a 
positive-definite function, then the integral is positive. But when the integral 
is continued away from the region where this definition converges, then the 
subtraction terms mean that the integrand is no longer positive definite, so 
that the integral need not be positive. 

At the end of Section 4.1, we proved that the value of a d-dimensional 
integral does not depend on how we split the integral into an ordinary 
integral over some integer-dimensional ‘parallel space’ and a spherically 
symmetric integral over the remaining dimensions. We let J be the 
dimension of the parallel space. Then the proof consists of examining what 
happens when J is increased by one. Ultimately we had to prove (4.1.17), 
which is a property of ordinary integrals. We assumed d > J, so that there 
were no subtraction terms. To generalize the result to the case that d — J is 
not positive, we must prove that 


x [J 2 —d/2] 
Ky | dpe") fod- X f(Oyp int | 
(0) n=0 
al ak | appt? fa + p2) 
— x (0) 
[(J + 1—d)/2] 
= > J caer: | (4.2.7) 
n=O 


Here the symbol [a | denotes the largest integer smaller than a. To prove the 
equation we change variables on the right-hand side to x and p, where 
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pt = xp? and k? =(1 — x)p”. For the right-hand side we get 


1 œ 
kaa] dxxť7773¥2(1 = f dpp*- 7? x 


(0) 0 


[(J + 1—d)/2] 
x fd- YL fd - spot 
n=0 
1 oO 
=iK. 14 dxx@-J- 327] — x)" 1/2 | dpp y a x 
0 J 0 


[J/2 —d/2] 
« {| 10 - y fone /n | 


n=0 
[(J + 1 —d)/2] 


2n [(J — d)/2] jronvn,2j 
= |a- x)p*)— — = Py fY0 ot td A dá |} 


n=0 n!(n —j)! 


(4.2.8) 


Here we have added and subtracted 
[(J —d)/2] 


2 LO Opn, 


so that the integral over p? of p’~/~? times each square bracket term is 
convergent. After scaling p° by (1 — x), we get: 


1 oe) 
Hai] dx x@- I~ 3/2 (4 -a| dp PI TIam x 
(0) (0) 


[J/2 —d/2] 
x {| so- y fop: | 


n=0 
[(J + 1 —d)/2] x”(1 = xr —d/2 Hyen 


n=0 n! 


{(J —d)/2] 
(n)( p2 (j) 


j=n 


Integration by parts in the p-integral gives 


1 
Ky-s-1| dx x67321 — x) 1/4 x 


0 


[O + 1-0/2] (1 + J/2 —d/2)! 
x [ = eee EAE a | x 


ng] — xW/2-d/2-n 
ee ni(1 + J/2—d/2—n)! 


m [J/2 —d/2] 
x | app so p f ont | 
(0) 
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An integration by parts on the x-integral is used to show that it equals 
P(1/2)r[ (ad — J — 1)/2]/T[ (d — J)/2], from which the required result 
follows. 


4.3 Properties 


Property 1. Axioms: The definitions (4.1.16) and (4.2.5) satisfy Wilson’s 
axioms (4.1.2), etc., for d-dimensional integration. 

Proof. We reduced d-dimensional integration to ordinary integration so 
linearity follows from linearity of ordinary integration. We must choose the 
P|, space to be large enough that it is the same for both functions f and g in 
(4.1.2). Our explicit continuation (4.2.5) to arbitrary negative d ensures that 
reducing the dimension of the transverse space is no problem. 

Scaling and rotation covariance are explicit properties of all our 
definitions. 

Translation invariance is valid for ordinary integration, so it follows from 
definition (4.1.16) provided the p, space is big enough to include the vector q 
used in the axiom (4.1.4). 


Property 2. 
ie (p?) L ald å+ 2a- 26 I(x + d/2)T (8 — « — d/2) 
(p? + M°) P(d/2)1(B) 
Proof. Immediate from (4.2.5). Note that this implies that the integral of a 
power of p? is zero, since I'(f) ~ 1/B as 8—0. 


(4.3.1) 


Property 2a. 


[aro =0. (4.3.1a) 
Proof. Already done. 
Property 3. 
O | a a 0 
= |d°P/(p,q,...)= [dP S.t). (4.3.2) 
oq oq 


Proof. Contract with a vector ôq which projects out the derivative with 
respect to the component of q in the ôq direction. Then make the parallel 
space (of p,’s) big enough to include ôq and use (4.3.2) on ordinary integrals. 
This is true for all dq. 
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Property 4. 
646; ; =d. (4.3.3) 


Proof and definition of ô; ,. Now 6'/ is defined to be the component form of 
a contravariant tensor with 6 = 1 if i = j and zero otherwise. The obvious 
definition of the covariant tensor 6, j is as the inverse matrix, i.e., the same 
thing. This gives 65,,= œ. However in an infinite-dimensional space, 
there is space for a different definition. 

A contravariant tensor may be defined by specifying its components. But 
a covariant tensor w is fundamentally a linear function acting on covariant 
tensors: w(T).We can write w(T)= œ;; TĦ only if the sum converges. 

We need the covariant ô (which we symbolize by 6,,) to be rotation 
invariant, and to give 6(T)= T" whenever the sum exists. We would also like 
contraction with ô;; to commute with integration. For example 


Ô; j | d“pp'p’ f (p°) = ô;; | d“po"p* f(p*)/d 
Ôi; [avviso = [amro 


Since we have an infinite sum, we cannot immediately apply linearity to 
prove this equation. 
Let us define 


dV (d/2 y 
ô(T) = au ) d'p ¥ T‘ip,p,5(p? — 1). (4.3.5) 


i,j 


Whenever )' 7" converges, this definition gives 
6(T) = y Tï 
But if the sum diverges, then it is possible to get a finite value for ô(T). In 
particular, 
dI (d/2) 


gil? 


(6!) = dipp? ap? — 1)=4, 


as required. The definition is rotationally invariant. Commutation of 
contraction with ô; and integration will now be a consequence of 


commutativity of two integrals — which we will prove later. 


Property 5. Integration by parts: 


| dp ôf (p)/dp' = 0. (4.3.6) 
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Proof. Work component-by-component. Contract with an arbitrary 
vector k: 


0 
dni 


Then, to define this integral, we must put k in the parallel space, and we can 
use the proof of (4.3.6) for ordinary integration in the space parallel to k. 


Property 6. To define integration over two (or more) variables: 
[ipak sk;a,, ss -qy) 


we must choose to calculate one integral then the other, according to the 
rules already stated. 

For this definition to be sensible we need the result to be independent of 
the order of integration: | 


| dp | d*k f= | dík | dp f. (4.3.7) 


We could also allow the dimensions of the p- and k-integrals to be 
different. Then exchange of order of integration {d*p{d“k— {d*k{d*p. 
is allowed only if d =d’, or if f is independent of p-k. 

Proof. It is sufficient to consider the case that there are no q,’s, so that 
f=f(p*,p-k,k’). (If there are q,’s, then we take out a finite-dimensional 
integral for both k and p which spans all q,’s and then we apply the theorem 
to the remaining dimensions.) 

The left-hand side of (4.3.7) is 


4ni 1/2 © x a 
ee E a d-—1,d-2 2 2 2 
aera | es | KA a a et E ah 

(4.3.7L) 


while the right-hand side is 


Art 1/2 W a a re 
areas, ak (an, | dprpy 7k" * f(p? + pt, pik, k?) 
(4.3.7R) 


Here k, is the component of k parallel to p, while p, is the component of p 
parallel to k. Change variables to, say, p*, k?, and z=p-k/(pk)= 


D,/ </(p? + p2) =k, / ./(k? + k?), with the result that both (4.3.7L) and 
(4.3.7R) are equal to 


Ani 1/2 e0) 


ea eas ene ee d maa d {= 2\(d — 3)/2 zA kz, k?). 
ADA- Jo a ae a a, 


(4.3.7C) 
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The theorem is thus proved in the case that (4.3.7L) and (4.3.7R) are both 
convergent. Note that it is not a trivial consequence of the corresponding 
result for integer-dimensional integration. 

If the dimensions of the integrations are not the same, then let the k 
integral have dimension d’. The left-hand side gives 


Aner ee? me ‘a re 2\(d’ — 3)/2 

as | dy’! | dk | dz(1 — z^) ~ "7 
TADE = 1/2) Jo P” | o z 

which in general is not the same as the corresponding expression for the 

right-hand side. But if f is independent of z, then the z-integral can be 

computed explicitly. The result is 


| d*kd“pf(p*,k’) = oe) dp p** | i dkk”! f(p?,k?). 
i I(d/2)T(d'/2)] o 0 
(4.3.8) 


A problem is that if d is not positive, we must make subtractions as in 
(4.2.5). These are clearly asymmetric between the two orders (4.3.7L) and 
(4.3.7R) of performing the original integral (with now d’ = d); in practice, d 
will be the number of dimensions transverse to the external vectors 
d.>---»Qy- In applications to Feynman graphs d will therefore be negative in 
order to regulate UV divergences. So we must use (4.2.5) to define the 
integrals. Then (4.3.7) does not give (4.3.7L), (4.3.7R) and (4.3.7C). 

We solve this problem by defining an auxiliary integral with a 
convergence factor, say 


I(a,d) = | dp | dk f (p,k) exp [ — a(p? + k2)]. (4.3.9) 


Assume f is power-behaved at infinity. Then for all d, (4.3.7L or R) is UV 
convergent. Moreover, if d>1 then both (4.3.7L) and (4.3.7R) are IR 
convergent without subtractions. The function I(a, d) is analytic in a and d. 
Continue down to small enough d that (4.3.7) is UV convergent. Then 
I(a,d) is given both by (4.3.7L) and (4.3.7R) with f replaced by 
f exp[ — a(p? + k’)], and with subtractions made. Now set a= 0 to prove 
the theorem (4.3.7). 


Property 7. 


for ferns (p? + k?) = farea f (q?). ~ (4.3.10) 


Proof. Since f is independent of p-k, the previous theorem shows that the 
left-hand side is independent of the order of integration, even if the 
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dimensions of the p- and k-integrals are different. Then use (4.3.9) and 
change variables to q = (p° + k’)'/* and x = p?/q’. 


Property 8. 
l 0 if t is odd 
dy nit... nit 2) _ : ; 
fa pp": -p*g(p") l Ti 4 [g], if t is even, (4.3.11) 
with 
Tot = [ditt26 i344. Oi tye. 
+ all permutations of the i’s ]/t!, (4.3.12) 
and 
_T@/2)0(t/2 + 1/2) | aa anya pr 
2n’ T(t/2 + 1/2) (> 
a ei a (t/ /2) dp p’*'~ 1g(p?). (4.3.13) 


~ T(1/2)0(d/2 + t/2) J o 


Proof. If t is odd, antisymmetry of the integrand under p— — p makes 
the integral over the ‘parallel’ space zero. 
= Antisymmetry under reversal of one component of p, symmetry under 
permutations of the i’s, and rotation invariance give the general form 
(4.3.11) and (4.3.12). Computation of one component (say, i, =i, =: = 
i, = 1) then gives (4.3.13). 


Examples. 
| d“pp'p’g(p*) = 4/05" fapa, (4.3.14) 
staid SiiGkl + SikGit 4 Silgik . 
jar Pip Ke) = a d‘p|p|*g(p?). (4.3.15) 


Property 9. Consider an integral 


(Pis... P) = {tp P2) (4.3.16) 


which is UV convergent by power-counting at d = 4; that is f = O(1/k**%) as 
k goes to infinity in any direction, for some positive number a. Then the 
integral is analytic in d and in the parameters p,, when d is close to four, if the 
integrand is analytic. If the p,’s lie in the first four dimensions, then the 
integral at d=4 has the same value as the ordinary four-dimensional 
integral of f. 
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Example Suppose f has the form 
f= (k,p,,--.,p,)exp(— Ak? + 2Ak-p), (4.3.17) 


where for any vector v we let V be its projection onto the first four 
dimensions and let ¥ be its projection onto the remaining dimensions. Then 
we let I be the ordinary four-dimensional integral of f. By use of our 
definitions of d-dimensional integration, we have 


[= [ores 
= Ta *kexp — Ak? + 2Ap-k) 


= I(n/A)"/?~ * exp(Ap?). (4.3.18) 
This is manifestly analytic in d and p. If we set d = 4, the integral becomes 
I = Iexp(Ap’). 


Notice that there is no restriction on f, even though p=O in four 
dimensions. However, if we let d—4 and p-—0, the limit is smooth. 

Proof of Property 9. The proof is easily made by examining the definition 
of the d-dimensional integral in terms of ordinary integrals. As usual we 
divide the space into a finite-dimensional parallel space big enough to 
contain p,,...,p,, and into a transverse space containing the remaining 
dimensions. It is convenient to choose the parallel space to have an odd 
number 2N + 1 of dimensions. Then: 


I is dk É d E ER as k2\d- 3v2-N 
= vr kanyi | dk FS 
L ' i wN, A i 


N-2 

x Fi, kysis konek = S Fk ses kN „okèn | (4.3.19) 
n=0 

Here we have used F to denote f considered as a function of the first 2N + 1 

components of k and of k*. Since f is an analytic function of k, it can be 

expanded in powers of k?. 

Asrequired by the definition, we have subtracted off a power series in k+,to 
give convergence at k = 0. We use F to denote the nth derivative of F with 
respect to k2. The integrand of the k? integral behaves as k? °, so we have 
convergence at k,=0 if d>3. Since f is analytic, there are no other 
singularities at finite k, and the only other possible source of a divergence is 
from large k. The subtractions do not introduce a divergence provided that 
d<5. Moreover, we have assumed that f= O(1/k***) as k— œ, so that 
there are no other large k divergences when d is close to four. Hence (4.3.19) 
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converges and is analytic in a neighborhood of d= 4. 

Now the integral will depend on the p,’s only through the Lorentz scalars 
P,P, (with 1 <a <b < J). To determine this dependence, it is sufficient to 
keep the p,’s within some fixed J-dimensional subspace. Since (4.3.19) is a 
perfectly finite integral, it is an analytic function of the p,’s. 

To determine the value of the integral when d = 4 and when the p,’s are in 
the first four dimensions, we use the freedom to vary the dimension of the 
‘parallel’ space in the definition (4.3.19). Let us now make it four 
dimensional. We will obtain an integral of the form: 

œ D724! 2:=2 00 


= aoak en kki- 5f „ka, ka, k2) (4.3.20 
' | dey... dk amy | , SFR Pkn kas kas kas R?) (43.20) 


if d>4. When we let d—4, the integral over k is singular at k = 0; the 
resulting divergence cancels the zero of the inverse T -function to give 


I(d = 4) = | dk dk, dk,dk, f(k, kz, k3, k4, 0), (4.3.21) 


as required. 
We may alternatively continue from d < 4 using 


Igi? o PeR aS T 
I= (aby... dass dkk [f (ki, Ka, R?) — (kip... k4,0)]. 


(4.3.22) 


The singularity at k = 0 is cancelled, but as d— 4 we get a divergence at 
k = œ which gives the same result (4.3.21). 


Comment In this proof we used the freedom to alter the dimension of the 
parallel space. To show that the integral is well-behaved at d = 4, it was 
convenient to choose the parallel space to have an odd dimension. But to 
compute the actual value at d = 4, it was convenient to choose the parallel 
space to have an even dimension, specifically, four. It is instructive to see the 
equivalence in a simple non-trivial case. (The general case was summarized 
at the end of Section 4.2.) 
Suppose 3 < d < 4. Then define 


d/2-2 


n E E Se 
I, = dk (k p= k,,k k? 4.3.23 
= Fara |, EEO Plas kas kas kash) (4.3.23) 


=|) po * dk2(k2)4- PL f(ley,..-5 kay k? + 2) 
: rd@-5)Jo T i 
—f(ky,---skasks)]- (4.3.24) 


— œ 
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Our definition of the d-dimensional integral of f tells us that 


I= | dk, ...dk,I, = [áki dka» (4.3.25) 


so we must prove that J, = L. 
To do this we change variables in I,, by setting k2 = xk? and k? = 
(1 — x)k? to obtain: 
md - 5)/2 1 oo 
| ne d fe2(e2)4/2- 30] — y) 1/2 4/2- 7/2 
> “aaa. | a (k*) (1 — x) x x 
x Lf (ys 025g, k?) — f (Ky, .- -3k43 k? — x))] 


gd - 5/2 1 co 
Be eee I Oe dk2k4-©(1 — xy) 1/2 x 4/2 - 7/2 
rans) l «| eee: 


x {Lf (ky 5-25 kak?) —S (kis... Kq,0)] 
—[f(k,,..., ka, k2(1 — x))— f (kis... k4,0)]}. (4.3.26) 


In the last line we subtracted and added f(k,,...,k,,0), so that we can 
integrate seperately each term in square brackets. In particular, we have 


[ARRES kaska A — x)) —f(k,,...,k4,0)] 


=(1 — P42 CAE —f(k,,...,k4,0)]. 


Comparison with the definition (4.3.23) of I, shows that 


I, m7 '?P(d/2—2) f 
— a mamama = — 1/2 .d/2—7/2 
I, Y(d/2—5/2) ad x) x 
— x%2-7/2(1 — x)3/2-42] (4.3.27) 


The integral is in fact the analytic continuation from d>5 of a beta- 
function, so that it equals ['(d/2 — 5/2) (1/2)/T(d/2 — 2). The required 
result 7, = Z, follows. 


Property 10. Multiple integrals are correct at d = 4. 
Consider the integral 


10s Py) = [hyd fkn- ki Pir: Pa) (4.3.28) 


This might represent a Feynman graph with N external lines and L loops. 
Then the p,’s and k,’s represent momentum vectors. Suppose that at d = 4 
the integral is completely convergent — in particular that there are no ultra- 
violet (k > œ) divergences or subdivergences. If we restrict the p,’s to the 
first four dimensions and set d = 4, then I is the ordinary four-dimensional 
integral of f. 
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Proof. For each vector v, we define the projections ¥ and v, onto the 
physical and unphysical dimensions, as before. The result to be proved is 
that if p; = P; for each of the p,’s then J defined as the limit as d > 4 of the 
dimensionally regulated integral is identical to the ordinary integral. We 
can split each integral over a k; into an ordinary four-dimensional integral 
over k, and a (d — 4)-dimensional integral over k,. The result to be proved is 
then that 


lim fat“, ...d?-*k, f(kK,,...,K.,p,---> Py) =S(K,,...,K,, Pis... By) 
(4.3.29) 


as d— 4. 
This formula is proved by doing all but the integral over k,. Let the result 
be I: 
(1)° 


Lok) = | d?-4k, ...d1-4k; f; (4.3.30) 


its only dependence on K, is via its length. We then have that the left-hand 
side of (4.3.29) is 


forty = l „(0)= [ted f(b Kya Barissa 


by use of the Property 9. Notice that the dependence on k; is on its first four 
dimensions. We can then repeat this process to show that 


IlO) = [asd fa a ay) 


Another L— 2 repetitions give (4.3.29), from which the desired property 
follows. 


4.4 Formulae for Minkowski space 


In this section we derive a collection of results that are useful for Feynman 
graph calculations. 


4.4.1 Schwinger parameters 
To convert an arbitrary graph in d dimensions to a parametric integral, we 
first rewrite each propagator using 


1 1 


BR rg] apla] 4D 


Then we perform the momentum integrals. Since all Feynman graphs are of 
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the form of a polynomial in momenta times a product of simple scalar 
propagators, we only have to calculate d-dimensional integrals of the form: 


Į" --#n( A, B) = fase _..kmexp [ —(— Ak? —2B-k)]. (4.4.2) 


Here A depends only on the parameters introduced by (4.4.1), while B“ 
depends on these parameters and also linearly on the other momenta (both 
external and loop momenta). 

By linearity we can find I, by differentiating Io: 


"n /1 ð 
pemn J] Ga) {atkexpcai? + 2B-k). (4.4.3) 
Mj 


j=l 
(This uses linearity of d-dimensional integration.) We find J, by using the 
translation k“ > k“ — B“/A, the scaling k—k A` +’, and Wick rotation: 


I,(A, B) = [ikopa +2B:k) 


= i(n/A)*/* exp( — B?/A). (4.4.4) 
Thus 


[t= | dkk" exp(Ak2 + 2B-k) 
= i(n/A)"2 exp( — B?/A)( — B*/A), (4.4.5) 
= fasene exp(Ak? + 2B:k) 
= i(n/ A)" exp (— B?/A)(B“B’/ A? —4g*”/A), (4.4.6) 
pw = | d4k kĉk"k” exp(Ak? + 2B-k) 


> „| — B?B“B* (Bĉg"” + B"g*%™ + B’g*" 
= i(n/A)4/2e-* A = J pore ae (4.4.7) 


| [askre exp (Ak? +2B:k) 


= i(m/A)4/2 e78"/4 B* BB" B” M (B*B*g”” + five similar) 

= i(7/ e J7 m a 
(g**g"" + two similar) 
St gpg (4.4.8) 


Each of the loop-momentum integrals is performed in this way. At each 
stage the momenta only appear quadratically and linearly in the exponent. 


4.5 Dirac matrices 


4.4.2 Feynman parameters 


It is also common to use 
1 
1/(AB) = | dx/[Ax + B(1 — x)]? 
0 


and its generalizations: 


1 T'(a+ B+-:--s) 


— E M X 
A'B- E TAr) Te) 


1 
x [ dxdy---dzd(1 — x — y—---z)x 
JO 


a a ge | 
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(4.4.9) 


(4.4.10) 


Here A, B,...,E represent the denominators of the propagators of a 
Feynman graph. The resulting momentum integrals have the form 


: ke Ss ek!” 
Minn — ee 
Tma [e eae 
Application of (4.4.1) and the results (4.4.4)-(4.4.8), etc., gives 
Jo= [orn — k? —2p-k + C) 
= in"? (C + p°)? -°T (a — d/2)/T (a), 
J! = fas k"/(— k? —2p-k + C)* 
= in"? (C + p?) — pr (a — d/2)/T (a), 
Jv = [ert kk’ /(— k? — 2p-k + C)* 


= int? (C + pyr x 


x [D(a — d/2)p*p’ — T(« — 1 — d/2)g*(C + p*)/2]/T(a). 


4.5 Dirac matrices 
The Dirac matrices satisfy the following properties: 


(1) Anticommutation relation: 
{yy} = 2g""1. 


(4.4.11) 


(4.4.12) 


(4.4.13) 


(4.4.14) 


(4.5.1) 
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(2) Hermiticity: 


ye ifu=O, 
ye =y = 4.5.2 
t Yu j — y“ if u > |. l ) 


When we use dimensional regularization, the Lorentz indices range over an 
infinite set of values, so we need infinite-dimensional matrices to represent 
the algebra (4.5.1). We will also need a trace operation: 


tr 1 =f (d), 


so that the representation behaves as if its dimension were f(d). We must 
require f(d) to be the usual value at the physical space-time dimension, 
d=d,. Usually this means f(4) =4. 

The trace is a linear operation on the matrices which we will define later. 
In an even integer dimension d = 2m, the standard representation of the y”’s 
has dimension 2°. However, it is not necessary to choose f(d) = 2“. The 
variation f (d) — f (d,)is only relevant for a divergent graph, so, by Chapter 
7, any change in f(d) amounts to a renormalization-group transformation. 
It is usually convenient to set f(d) = f(d,) for all d. 

To set up a formalism for dimensionally regularized y-matrices, we must 
treat the following issues: 


(1) We must exhibit a representation of the anticommutation relations; 
this will ensure consistency. 

(2) The formulae for the trace of an arbitrary product of y”’s must be 
derived. 

(3) While a knowledge of the y’s alone is sufficient for QCD and QED, we 
must show how to define a y, so that we can treat chiral symmetries. 
This will also give us a definition of the antisymmetric tensor ¢,,,,. 

The following construction gives a representation: 

Let w be a positive integer, and suppose inductively that we have 
defined a 2° dimensional representation y{,) of the algebra (4.5.1) for 
O<p< 2m — 1. We define the infinite dimensional y” for 0 < u < 2w — 1 by 
having a sequence of ,)’s down the diagonal, and zeros everywhere else: 


(4.5.3) 


We will construct the next higher representation y£, , ,, of dimension 2°* s 
with 0 < u < 2m + 1. In order that (4.5.3) apply independently of w, we must 
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choose 


yt, ,=(2@ ° if0<p<20-1 
Yiot 1) — 0 vi WUSPS+@M— 1. 
(œw) 


This satisfies the anticommutation relations (4.5.1) and the hermiticity 
relation (4.5.2), provided that 0 < u, v < 2m — 1. Our task then is to find 
Yo+1) fOr p= 2m and 2m + 1. 

Notice that the induction starts with œ = 1. We can define 


1 0 0 1 | 
m=( a a 5) (4.5.4) 


Given the 2°-dimensional representation y% we define another matrix 


Tioj = 1°" Ver j via (4.5.5) 
Observe that 
Ta = Tap Vas = l, Var Pog = 0. (4.5.6) 


Also, when at œ = 2, we have f = y,, in the usual notation for Dirac matrices 


at d=4. We define 
yo -( 0 ~ 
(w+ 1) A ’ 
Yo) 0 


Yo. = sy pa (4.5.7) 
Vo 0 

It is easy to check that (4.5.1) and (4.5.2) are satisfied for O< yp, 
v<2w+1. 

We now have an explicit representation of the Dirac matrices for any w, 
and for the infinite-dimensional case, because of (4.5.3). 

Standard manipulations involving the anticommutation relations are 
valid independently of d. Two useful results are: 


yy, =H y1 = gil = dl, (4.5.8) 
Ve Gig a 
= (2 —d)y,. (4.5.9) 


We also need traces of y-matrices, in graphs with fermion loops. The trace of 
a matrix is linear: 


tr (aA + DB) =atr(A) + btr(B), (4.5.10) 
and is cyclic: 
tr (AB) = tr (BA). (4.5.11) 


Here A and Bare any product of y-matrices, and a and b are any numbers. 
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These properties, together with the value of tr 1, define the trace of any 
linear combination of products of y-matrices. 
For example, 


tr(y"y") =tr(y’y")  (cyclicity) 
=tr(— y’y’+2g""l) (anticommutation) 
—tr(y*y’) + 2g’’tr1 (linearity), 


so we have the usual result 
tr(y’y’) = gtr 1. (4.5.12) 
Similarly 
tr (y*y*y"y") = gh — g'tg® + gg") tr 1. (4.5.13) 


The trace of the product of an odd number of y-matrices is zero. For 
example 


dtry* =tr (y*y,y4) 
— tr (y"y*y,) + 2 tr yt 
— tr(y,y*y4) + 2 tr y4, 


so try? =Q. 

It should be possible to make a more constructive definition of the trace, 
along the lines of (4.3.5). It is necessary to check consistency. We can find a 
formula for the trace of any number of y-matrices — generalizing (4.5.13). It 
is true for any finite-dimensional representation, yi», SO it agrees with the 
algebraic properties. Linearity defines the trace of more general products. 
We must also check that contracting with g,,, commutes with the trace. This 
can be checked directly. 

A possible explicit definition of the trace of a matrix with components 
M, j 1S 


ME ee 
trM,,=(tr 1) lim — > M ,;- (4.5.14) 
N> œ j=1 
This definition exploits the fact that each matrix y“ is an infinite set of copies 
of a finite-dimensional yi, strung along the diagonal. Since the y“’s are 
independent of d, the only possible d-dependence is in the choice of the 
value of tr 1. 


4.6 Ys and Exadpy 


In four dimensions, y, =iy°y'y*y* and ¢,,,, is a totally antisymmetric 


Lorentz-invariant tensor with £9123 = 1. We need y,, for example, to define 
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KAuV 


the axial current wy"y,w. The e-tensor comes in because y, = 
ig, aw"y*y"y’/4!, and we have the trace formula: 


tr y?y*y+p4y’ = ie’ tr 1 = — ie, tr 1. 


KAY 
The appropriate definition changes when we go to two dimensions: 


Instead of y, we have #,,, = °y}, and instead of ¢,,,, we have ¢,,, for which 


KAuvV pv? 


Eo, = 1 = — £10; ĉ&00 = £11 =O. 
To continue dimensionally, we might expect y, to satisfy 
{7597} = 0, 
just as in four dimensions. But then, as we will see in Chapter 13, the only 
consistent result for y, is that it has zero trace when multiplied by any string 
of y”’s. Thus we do not have a regularization involving the usual y.. 
A consistent definition is obtained by writing 


Y? = ip yp y?y? = ipy*yy"e ayy /4 4, (4.6.1) 
1 if (kAuv) is an even permutation of (0123), 
fx Auy 


— 1 if (kApy) is an odd permutation of (0123), (4.6.2) 
0 otherwise. 


This definition is not Lorentz invariant on the full space, but only on the 
first four dimensions. We have 


{y5.y"} =0, if w=0,1, 2,3, 
[y;,y"] =0, otherwise, 
(ys) =1, y5 =75. (4.6.3) 


The lack of full Lorentz invariance is a nuisance, but it does give the correct 
axial anomaly (t Hooft & Veltman (1972a)). 
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Renormalization 


In this chapter we come to the general theory of renormalization. The basic 
difficulty is that a graph may not only possess an overall divergence. It may 
have in addition many subdivergences which can be nested or can overlap 
in very complicated ways. Most of our effort must go to disentangling these 
complications. 

We will begin by investigating some simple graphs. These will show us 
how to set up the formalism in the general case. The ultimate result is the 
forest formula of Zimmermann (1969). Contrary to its reputation, this is not 
an esoteric procedure, designed for pedantically rigorous treatments. 
Rather, the forest formula is merely a general way of writing down what is in 
fact the natural and obvious way of extracting the divergences from any 
integral. Its power is demonstrated by the ease of treating overlapping 
divergences, the handling of which is normally considered the béte noire of 
renormalization theory. 

The forest formula is applied to individual Feynman graphs. It extracts 
the finite part of a graph by subtracting its overall divergence and its 
subdivergences. We will, of course, need to show that the subtractions can 
be implemented as actual counterterms in the Lagrangian. We will also 
show that the counterterms are local, i.e., polynomial in momentum. 

An important advantage of using the forest formula to obtain the finite 
part of each graph, rather than working directly with counterterms in the 
Lagrangian, is that the procedure applies to more general situations. As we 
will see in Chapter 6, it will enable us to renormalize composite operators. A 
more important case is the computation of asymptotic behavior as external 
momenta of a Green’s function get large. For this case, the forest formula 
permits a good derivation of Wilson’s operator-product expansion, which 
we will discuss in Chapter 10. 

Let the value of a Feynman graph be written as: 


(GPs Py) = [dka «+ dku HPs Py ikasiko) (5.0.1) 


Here p,,...,Py are the external momenta, and k,,...,k, are the loop 
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momenta. Renormalization is removal of that part of the large-k behavior 
that causes divergences. Moreover, the very same techniques can be used to 
extract the behavior for large p — as we will see when we treat the operator 
product expansion in Chapter 10. 

Although Weinberg’s (1960) theorem tells us the power-counting applic- 
able to either kind of asymptotic behavior, it does not tell us how to 
organize it. In particular it was only much later that Wilson (1969) 
formulated his operator product expansion, which is the important tool in 
computing asymptotic behavior, for example in deep-inelastic scattering — 
see Chapter 14. Many generalizations have been made — see Mueller (1981) 
for a review. These are phenomenologically very important, and the 
method by which they are proved is close to that for Wilson’s expansion. 


5.1 Divergences and subdivergences 


The idea of renormalization theory is that ultra-violet divergences of a 
field theory are to be cancelled by renormalizations of the parameters of the 
theory. We propose to prove this in perturbation theory. The use of 
perturbation theory implies that we expand the counterterms in the action 
in powers of the renormalized coupling, g, thereby generating extra graphs 
with these counterterms as some of the vertices. To avoid superfluous 
technicalities, we will consider the case of ø? theory in six-dimensional 
space-time. 

A very efficient way to understand renormalization was discovered by 
Bogoliubov & Shirkov (1955, 1956, 1980) and Bogoliubov & Parasiuk 
(1957), and we shall follow their approach. The first step is to decompose the 
Lagrangian as follows: 


LE a a i ae gre (5.1.1) 
Here Zo is the free Lagrangian used to generate the free propagator 
ip? — m? + ie) in perturbation theory: 

La =(Co)? /2 — m* 7/2, (5.1.2) 
with m being the renormalized mass. The rest of the Lagrangian, Z, = 
La +t L£.,,istheinteraction,andconsistsoftwoterms. Thefirst, which we will 
call the basic interaction, is 

LPa = 9/3! (5.1.3) 


where g is the renormalized coupling. The second term, Z 
counterterm Lagrangian. 
Consider graphs generated by the basic interaction. These have UV 


we will call the 


ct? 
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divergences which are to be cancelled by graphs with some of their 
interaction vertices taken from the counterterm Lagrangian 


L „= 5Z(8b)?/2 — m? ¢?/2 — 6963/3! — sho. (5.1.4) 


(The term linear in ¢ is needed to cancel tadpole graphs — see Figs. 5.1.4 and 
5.1.5 below.) In order to give meaning to 6Z, dm’, dg, and 6h, we must 
impose an ultra-violet cut-off. We will use dimensional regularization in the 
following sections. 

The key to the method that we use is to realize that each of the three terms 
in the counterterm Lagrangian should not be considered as a single 
quantity. Rather it is to be considered as a sum of terms, each of them 
cancelling the overall divergence in one particular graph generated by the 
basic interaction. For example, the self-energy graph, Fig. 3.1.1, gives a 
contribution ô, Z to 6Z, and a term 6,m? to 6m’. Our calculation of this 
graph led to (3.5.7), so with minimal subtraction we have 


5, Z = 9?/[384n°(d — 6)], 
ô m? = g?m* /[ 6413 (d _ E (5.1.5) 
Then 


ôZ= Y ôsZ=ô, Z+, 
graphs G 
with similar formulae for the other counterterms. 

We saw the utility of this idea by examining graphs like those in Fig. 3.2.1 
and Fig. 3.2.2. Graphs like Fig. 3.2.2 contain vertices corresponding to the 
counterterm 6,m? (and ô, Z). Such graphs are all generated by taking 
graphs like Fig. 3.2.1 with no counterterms and finding where Fig. 3.1.1 
occurs as a subgraph. Substitution of the counterterm for one or more of 
these subgraphs gives the graphs with counterterm vertices. 

This leads to the idea that we consider by itself the renormalization of a 
single graph generated from the basic Lagrangian. We add to it a set of 
counterterm graphs to give a finite result. Only as a separate step do we 
recognize that the counterterm vertices are, in fact, generated from a piece 
of an interaction Lagrangian. 

The graph-by-graph method is probably the most powerful approach to 
understanding not only the problem of ultra-violet divergences but also 
many other problems in asymptotic behavior. Even so, it is not at all trivial 
to ensure that the renormalization program can be carried out. The essential 
steps are: 


(1) To find the regions in the space of loop momenta of a graph that give 
ultra-violet divergences. 
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(2) To show how to generate a series of counterterm graphs for a given 
basic graph. 

(3) To show that the counterterm vertices are local (i.e., polynomial in 
momenta). 

(4) To find the conditions under which the counterterm vertices amount 
only to renormalizations of the parameters of the Lagrangian. 


The complications in carrying out this program arise when one treats the 


case of the divergence of a graph which has a divergent subgraph. To 
understand why there is a difficulty, we will examine the graphs of order g* 


for the full propagator — Figs. 5.1.1—5.1.3. 


-OO +O + —* -O- 4+ 
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Fig. 5.1.1. A two-loop graph for the propagator in ¢° theory, together with its 
counterterm graphs. 
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Fig. 5.1.2. A two-loop graph for the propagator in ġ?° theory, together with its 
counterterm graphs. 
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Fig. 5.1.3. A two-loop graph for the propagator in ¢° theory, together with its 
counterterm graphs. 
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(a 


We ignore the graphs with tadpoles, such as Fig. 5.1.4. These are 
divergent and need a counterterm d6hd. We can use a renormalization 
condition that <0|ġ|0> vanishes. Then the total of the tadpole graphs is 
zero (e.g., Fig. 5.1.5), so we omit any graphs containing them. 

Let us return to the sets of graphs listed in Figs. 5.1.1—5.1.3. In each set 
there is one basic graph and a set of counterterm graphs. Ultra-violet 


a 


Fig. 5.1.4. A tadpole graph, together with its counterterm. 
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Fig. 5.1.5. Graphs to O(g*) for <0|@|0>. 
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divergences involve a loop momentum that gets large, so the divergences 
are always confined to one-particle-irreducible subgraphs. The simplest 
case is Fig. 5.1.1, where the basic graph has two insertions of the one-loop 
self-energy. It is made finite by adding the graphs with one or both of the self- 
energy subgraphs replaced by a counterterm. (We use a cross to denote a 
counterterm ina graph, and weuse the label ‘1’ for the counterterm ofthe one- 
loop self-energy.) 

In Fig. 5.1.2, the basic graph is more complicated. We will treat it in detail 
in Section 5.2, and we merely summarize the results here. It has two UV 
divergences. The first comes from letting both loop momenta k and ! go to 
infinity; we call this the overall divergence. But there is also a divergence 
where the momentum in the outer loop stays finite. This is an example of a 
subdivergence. Its existence, as we will see, implies that there is a term 
proportional to p? ln (p°) in the divergence of the basic self-energy graph. 
This cannot be cancelled by any local counterterm. However there is also a 
graph with a counterterm to the subgraph. This graph is Fig. 5.1.2(b). After 
we add the two graphs, the non-local divergence cancels. The overall 
divergence in the result is then cancelled by a local counterterm, for which 
we use the label ‘2’. We implement this as a counterterm in the action by 
inserting terms ô, Z and 6,m? into the complete counterterms 6Z and 6m’. 

The pattern is simple. We consider as a single finite entity one basic graph 
together with counterterms for its subdivergences and for the overall 
divergence. 

Another case is shown in Fig. 5.1.3. There are two subdivergences, each 
corresponding to a vertex subgraph. The one-loop vertex is logarithmically 
divergent and is made finite by renormalizing the coupling (Fig. 3.6.1). 
Since the two divergent subgraphs overlap, the counterterm graphs are 
generated by replacing one but not both of the vertex graphs by its 
counterterm. The overall divergence is then local and is cancelled by 
counterterms 6,Z and ôm”. 
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Our first task in Section 5.2 will be to verify the above statements. To 
generalize the argument we will then observe that power-counting as in 
Section 3.3 determines the strength of the overall divergence. To prove that 
the presence of subdivergences does not affect the form of the overall 
counterterm, we will differentiate with respect to external momenta to 
remove the overall divergence. Then we will be able to construct an 
inductive proof that if subdivergences have been cancelled by counterterms 
then the overall divergence is local and its strength is determined by simple 
power-counting. 

We will also show how to disentangle the combinatoric problems when 
subdivergences are nested. Finally, we will discuss Weinberg’s theorem. 
This theorem tells us exactly which regions of momentum we must 
consider. In practice one is very simple-minded about locating UV 
divergences. For example, we stated that the regions giving divergences for 
Fig. 5.1.2are:(a) k and l going to infinity together, and (b) | going to infinity, 
with k fixed. In each region, all the momenta get large in a particular 1PI 
subgraph that is divergent by power-counting. Weinberg’s theorem tells us 
that these are the only regions we have to consider explicitly. In the case of 
Fig. 5.1.2 there is another region that is important, where l goes to infinity 
with k also going to infinity, but much more slowly. This region interpolates 
between the other two, but in fact does not need to be treated as a separate 
case. 


5.2 Two-loop self-energy in (¢°), 


In this section we will explain the properties of overall divergences and 
subdivergences by computing the two-loop self-energy graphs, Figs 5.1.2 
and 5.1.3, in ° theory at space-time dimension d = 6. We will again use 
dimensional regularization, and will need the values of the one-loop 
counterterms in order to cancel subdivergences. 

The one-loop self-energy was considered in Section 3.6.2, where we found 
that the counterterms needed were given by (5.1.5). We can also compute 


the one-loop vertex graph, Fig. 3.6.1, with the resulting counterterm being 
(cf., (3.6.13)) 


539 = w3~4293/[64n3(d — 6)]. (5.2.1) 
It is worth noting that this implies a value for the one-loop term in the bare 
coupling: 
Jo = [u° "?g + 53g + O(g°)]Z-°” 
= gu"? {1 +-497/[64n°(d — 6)] + O(g*)}. (5.2.2) 
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5.2.1 Fig. 5.1.2 


In order to be able to compute Fig. 5.1.2 in closed form we work with the 
massless theory. The value of the graph is then 


1 d}-ad l 
<4 ke aE TE TOE Tay 629) 


The inner loop is easily computed in terms of I’-functions: 


1 1 
| dlas = in PT (2 — d/2) | dxf — k?x(1 — x)]”27? 
(k—1 5 


)? 
= in" P T(2 — Ta re z = (—k?)#2-2, (5.2.4) 
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The denominators can be combined by a Feynman parameter: 
1 _ T(5-—d/2) |° y ae 


Len m nM. 5.2.6 
A*~4?2B T(4—-d/2)]o “TAx + BUI — x) J574? RER 
after which the k-integral can be performed. The result is 
2 2 n2 d—6 
_{9 \)P (=P 
*20 = (i) 2 (z) i 
T(2 — d/2)T (5 — d)r (d/2 — 1PT (d — 4) 
T(d — 2)T (4 — d/2)T (3d/2 — 5) 
g? \?p?/ — p2\4~-6 
=| —— } —( —; r(2 — d/2)T (5 — d)K (d). S241 
(£5) “4 (2 —4/2)T(5 — 4) K(d) (5.2.7) 


The overall ultra-violet divergence is contained in the factor T(S — d). 
Observe that the argument of this T -function is exactly minus half times the 
degree of divergence. The subdivergence is contained in the factor 
I'(2 — d/2); this is the same as we calculated in Chapter 3. 

Before we discuss further the UV divergences we should observe that 
there are also infra-red divergences. These come from the existence of long- 
range forces in a theory with massless fields. In momentum space, they 
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appear as divergences when some momenta go to zero. For example, if 
_d<2 the integral over the momentum through any propagator has a 
divergence at zero momentum: 


| d*q/q? ~ constant /(d — 2) at d close to 2. 
q~0 


This accounts for the factor I'(d/2—1)?. When d< 4 there is also a 
divergence at k=0 with l and p fixed. Our only concern is with UV 
problems, so we ignore the IR divergence. If we used a massive field, the IR 
divergences would go away, but we would not have an explicit formula for 
eae 

Now let us expand Ł,, in powers of d — 6 to exhibit its divergences, and 
its dependence on p°: 


g? 2 p2 1 1 — p? 
2, = ee S TET PEE 
Sa (£5) Arent +ga n( 2s) + constant | + 


+41n? p z ] + constant in a + constant + O(d — 6) >.. 
Anu anu 


(5.2.8) 


The double pole at d = 6 and the double logarithm in the finite part are both 
reflections of the fact of having a subdivergence. The p-dependence is a 
power of p° times a polynomial in In( — p°). This is a characteristic feature 
of massless theories. 

The simple pole has a coefficient that is not polynomial in p. 
Consequently, it cannot be cancelled by any local counterterm. It 1s easy to 
see that this is caused by the presence of the subdivergence. The 
subdivergence comes from the region where the loop momentum of the 
inner loop goes to infinity while the momentum k in the outer loop remains 
finite. Integrating over finite k gives a logarithm of p times the divergent part 
of the inner loop. We have already introduced into the Lagrangian a 
counterterm for the inner loop, so that there is a graph, Fig. 5.1.2(b), in 
which this counterterm appears in such a way as to cancel the 
subdiver gence. 

Therefore the sum of Figs. 5.1.2(a) and (b) should have no subdivergence, 
but only an overall divergence. This can be cancelled by a local counterterm 
(1.e., a polynomial in p). We will prove this in Section 5.2.2 by differentiating 
three times with respect to the external momentum p“; this gives a result 
which has negative degree of divergence, i.e., there is no overall divergence. 
Since the subdiver gence is cancelled, there is no subdivergence whatever, so 
the counterterm must be quadratic in p. We represent this by Fig. 5.1.2(c). 
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In Section 5.2.2 we will make explicit this proof of locality of the 
counterterms. 

Here we will verify the above statements by explicit calculations. In our 
case that m= 0, the value of Fig. 5.1.2(b) is 


2,,6-d 2 

g'h k 
Eee e ed k 
ae 1 (27)? | (k?)?(p + k)? 


-( g? ) (Eepe —p d/2 -3 
64r? 6 (d — 6) (d — 2) (=, 


E g? tp =) 1 —p’ 
=(=) sree tl- In z2 + constant 
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(5.2.9) 


The non-local divergence disappears when we add this graph to £,,, with 
the result 


a ks a ee | constant 
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+ In? = + constant In (z5) + constant + O(d — J ; 


(5.2.10) 


The non-local divergence has cancelled, as promised. However, the double 
pole and, in the finite part, the double logarithm have not cancelled, even 
though it is evident from our calculation that they are associated with the 
subdivergence nested inside the overall divergence. This is a general 
phenomenon. Indeed we will see in Chapter 7, where we discuss the 
renormalization group, that the coefficients of the double pole and of the 
double logarithm could have been predicted from the one-loop counter- 
terms without any explicit two-loop calculations. 

Since the non-local divergences have now cancelled, the overall diver- 
gence can be cancelled by choosing a wave-function counterterm 


2 2 
g 1 — 1 constant 
AE E en ig ON 52.11 
2 (>) spt (d—6) eal) 


Then we obtain at d= 6 a finite result, which we term the renormalized 
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value of Fig. 5.1.2: 


TOR = Lag + Lay + (Ly, = — P°52Z) 


2 2 a2 2 2 
g P —P —P 
= (45) 36 f; In? (z) + constant In (z5) + constant L 


(5.2.12) 


5.2.2 Differentiation with respect to external momenta 


We saw that the graph Fig. 5.1.2 has an overall divergence which is local, 
but that it is local only after we have subtracted the subdivergence. In 
general we will need to show that the counterterm of a IPI graph is a 
polynomial in its external momenta with degree equal to the degree of 
divergence. Our argument (following Caswell & Kennedy (1982)) depends 
on differentiating with respect to external momenta. 

In this subsection we will apply the argument to Fig. 5.1.2, emphasizing 
its generality. Then in the next subsection we will apply it to Fig. 5.1.3. Even 
though that graph has an overlapping divergence, traditionally considered 
a hard problem, we will see that our method works as easily for this graph as 
for Fig. 5.1.2. 

We first differentiate Fig. 5.1.2(a) three times with respect to p“, to make 
its degree of divergence negative. We represent the result pictorially by 
Fig. 5.2.1, where each dot indicates one differentiation with respect to p. 
Similarly, differentiating Fig. 5.1.2(b) three times gives Fig. 5.2.2. Now 
Fig. 5.2.2 cancels the subdivergence in Fig. 5.2.1, and there is no overall 
divergence, so their sum is finite. Thus the third derivative of the sum of 
Figs. 5.1.2(a) and (b) is finite. So the overall counterterm is quadratic in p. 
Lorentz invariance forces it to be of the form A(d)p? + B(d). 

We glibly asserted that Fig. 5.2.1 plus Fig. 5.2.2 is finite. This statement is 
not as obvious as it seems. Let us prove it. We Wick-rotate the integrations 
over k and lin Fig. 5.2.1, and consider regions of the integral that might give 
a UV divergence. If k and l go to infinity at the same rate, then there is no 


F ig. 5.2.1. Result of differentiating Fig. 5.2.2. Result of differentiating 
Fig. 5.1.2(a) three times with respect to Fig. 5.1.2(b) three times with respect to 
its external momentum. its external momentum. 
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divergence, because the degree of divergence is negative. If l goes to infinity 
with k fixed, there is a divergence, but it is cancelled by the counterterm 
graph, Fig. 5.2.2. 

The remaining significant possibility is that both k and / go to infinity, but 
that kis much less than l. The ratios of different components of either one of 
k or l are finite, so we may summarize the order of magnitude of the 
contribution from this region as: 


finite | dk ef dll. (5.2.13) 
l>k 


This gives a divergent contribution, if l is of order k*/?. We must add 
Fig. 5.2.2, which, as we will show, cancels this new divergence. Observe that 
the counterterm was arranged to cancel the divergence when l goes to 
infinity with k fixed, rather than when k is large, as we now have. 

Let us expand the inner loop of Fig. 5.2.1 in powers of k when I > k, up to 
its degree of divergence, which is quadratic. The coefficients of these powers 
are integrals over all l, restricted to l > k. The divergences in the coefficients 
are cancelled by Fig. 5.2.2, and we have the following estimates of the sizes 
of the coefficients: 


coefficient of k? = finite {| dip divergence | 
k 


æ k 
= nite} | dll -divergence | + finite | dil 


1 1 
= 0(k°), 
coefficient of k! = O(k), 


k 


coefficient of k? = finite | dl/l — divergence | 


= O(In(k)). (5.2.14) 


The sum of Figs. 5.2.1 and 5.2.2 in the region k > œ, with l possibly much 
bigger than k, is then of order 


| * dk k-2In (). (5.2.15) 


The power of k is the same as is given by the overall degree of divergence, 
but there is an extra logarithm. We get a finite result, as claimed. The higher- 
than-quadratic terms in the expansion of the loop in powers of k give no 
divergence at all. : 
What has happened? The divergence for 1>k*1 could only occur 
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because the interior loop was itself divergent. The fact that we sent k to 
infinity merely suppressed this divergence somewhat. Suppose we neglect k 
in the integral for the inside loop. Then the counterterm is in effect the 
negative of the integral over l of the loop from finite l to infinity. But in the 
region we are considering for Fig. 5.2.1, we are restricting l to be much 
bigger than k, which is itself getting large. So if we neglect k in the loop, then 
we are left with 


finite + | dl (integrand of loop with k neglected). (5.2.16) 
l<k 


Furthermore, we expand the loop in powers of k, to uncover the sub-leading 
divergences. Each extra explicit power of k in the expansion compensates 
for the lowering of the divergence. The quadratic term multiplies fdl/l, so 
giving an extra logarithm (but not a power). 

The key step in the proof is to perform the integral with the larger 
momentum | first. We have shown that, for the purpose of determining 
whether or not a divergence occurs, we need only consider as distinct 
regions: (1) k,!— œ at the same rate, and (2) l— œ with k finite. (We might 
also try k > œ with / finite, but the subgraph with the lines carrying the loop 
momentum k has negative degree of divergence, so we get no divergence 
from that region.) The region k, l— œ, with k <1, is schizoid: it can be 
considered as essentially part of either of the two regions (1) and (2) that we 
_ have just defined. As region (2), the divergence is cancelled by a counterterm 
when | > œ, with k large but fixed. As region (1), the final integral over k is 
finite, and the only sign of this intermediate case is the extra logarithm in the 
integrand. 


§.2.3 Fig. 5.1.3 


We conclude this section by considering the example of Fig. 5.1.3. Atd = 6 
the graph (a) has an overall quadratic divergence. It also has a logarithmic 
subdivergence when either of the loop momenta k or l gets large. The 
subdivergences are cancelled by vertex corrections, which are shown in 
graphs (b) and (c). We must prove that the overall counterterm (d) is 
quadratic in p. 

Conventionally, this graph is regarded as a difficulty in the theory of 
renormalization, for it contains an overlapping divergence. That is to say, 
one of the lines is common to both subdivergences. This is seen as a problem 
(Bjorken & Drell (1966)) if one tries to write the graph as an insertion of a 
renormalized vertex, Fig. 5.2.3, in the one-loop self-energy. The graph (b) 
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Fig. 5.2.4. Result of differentiating Fig. 5.1.3 three times with respect to its external 
momentum. 


for the counterterm to one of the subdivergences is not of this form. The 
corresponding difficulty does not happen in our first example, Fig. 5.1.2. 

However, our trick of differentiating three times with respect to p works 
as well for Fig. 5.1.3 as it did for Fig. 5.1.2. For the sum of (a), (b), and (c), we 
find Fig. 5.2.4. The point is that differentiating either of the subgraphs 
makes it convergent, while the counterterms for the subdivergences are 
independent of momenta. We get terms (a) and (b), which have re- 
normalized subgraphs, and graphs (c) and (d), which have no sub- 
divergences at all. None of the graphs has an overall divergence. The 
calculation of the overall counterterms is left as an exercise for the reader. 
The correct result is (Macfarlane & Woo (1974)): 


[ENII 1 1 1 
sz =( i) Ee Haa) 


g Y 1 1 1 
54m? “(is) ml — oar a) (5.2.17) 


if we use minimal subtraction. 


5.3 Renormalization of Feynman graphs 101 


5.3 Renormalization of Feynman graphs 


We have seen that, in order to construct a sensible (1.e., local) counterterm 
for a(1 PI) Feynman graph, one must first subtract off its subdivergences. This 
is natural since the subtractions for subdivergences are automatically 
generated from having the counterterms be definite pieces of the interaction 
Lagrangian. Without subtraction of the subdivergences, the divergence of a 
graph need not be local. It may even have a power of momentum greater 
than the degree of divergence; an obvious case of this is a graph that is finite 
according to naive power-counting but that has a subdivergence. 

It is therefore useful to devise a procedure for starting with a basic 
Feynman graph G, constructing a set of counterterm graphs, and thereby 
obtaining a finite renormalized value R(G): 


R(G) = U(G) + S(G). (5.3.1) 


Here U(G) is the ‘unrenormalized’ value of the basic graph (which diverges 
as the UV cut-off is removed), and S(G) is the subtraction — the sum of the 
counterterm graphs. 

The strategy we use to construct S(G) is very general. It applies to the 
asymptotic behavior of any integral as one or more parameters approach a 
limiting value. In field theory it can be applied not only to the re- 
normalization problem but also to the calculation of the asymptotic 
behavior of a Green’s function as some but not all of its external momenta 
get large. (A standard example which we will treat in Chapter 10 is the 
operator product expansion of Wilson (1969)). 

The procedure that we use for renormalization was first developed by 
Bogoliubov and Parasiuk (see Bogoliubov & Shirkov (1980)), with 
corrections due to Hepp (1966). Their construction was recursive and has 
the acronym BPH. Zimmermann (1969) showed how to solve the 
recursion — the result being called the forest formula. All these authors used 
zero-momentum subtractions. Zimmermann (1970, 1973a) showed more- 
over that there is then no need to use an explicit UV cut-off. He applied 
the algorithm for computing R(G) directly to the integrand rather than to 
the integral; this formulation has the title BPHZ. It is not necessary to use 
zero-momentum subtractions. For example Speer (1974), Collins (1975b), 
Breitenlohner & Maison (1977a, b, c) showed how to make the same ideas 
work using minimal subtraction. 

Our treatment will aim at showing the underlying simplicity of the 
methods and their power to demystify renormalization theory. We will see 
that the methods do not depend on use of a particular renormalization 
prescription, even though we will often use minimal subtraction. 
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We will examine the structure of the subtractions for a graph G. (A graph 
we define by specifying its set of vertices and lines, each line joining two 
vertices and each vertex attached to at least one line.) We write the graph’s 
unrenormalized value as 


Uc(Pis- -s Px) = [eth atk Lolo Pikpak (53:2) 


Here we let L be the number of loops and N be the number of vertices. The 
external momenta at the vertices are p;. In a Feynman graph for a Green’s 
function there is an external momentum at the vertices for the external 
fields, but at an interaction vertex, we have p, = 0. 


5.3.1 One-particle-irreducible graph with no subdivergences 


The simplest case is a one-particle-irreducible (1PI) graph with no 
subdivergences. Then the only possible divergence is an overall divergence 
where the momenta on all the lines get large simultaneously. We may 
renormalize the graph by subtracting an overall counterterm: 


R(G) = U(G) — T°U(G). (5.3.3) 


Here T denotes some operation that extracts the divergence of U(G). It 
implements whatever renormalization prescription that we choose to use. 
For example, we might use minimal subtraction. In that case T takes the 
Laurent expansion of U(G) about d = d}, and picks out the pole terms. (We 
let the physical space-time dimension be d, ;1.e., dọ = 4 for the real world, or 
d, = 6 for the f° model we used in the previous sections.) We will use either 
of two notations for the action of T on an unrenormalized object: T°U(G) 
or T(G). Both will mean the same. 

We could use zero-momentum subtractions. In that case T picks out the 
terms up to order 6(G) in the Taylor expansion of U(G) about zero 
momentum. Here 6(G) is, as usual, the degree of divergence. There are many 
other possibilities. In our work, we will use the minimal subtraction scheme. 

Then, for example, the one-loop self energy in (¢°), gives 
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The one-loop vertex, Fig. 3.6.1, gives 
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Observe that in this last case we define the pole to come with a factor 
u? 4/2, This is an example of a general rule that one must define the pole 
part of U(G) to have the same dimension as U(G), for all d. 


(5.3.5) 


5.3.2 General case 


In general we not only have to handle the case of an overall divergence, but 
also the case that subdivergences are nested within the overall divergences. 
Another case is exemplified by the propagator with two self-energy 
insertions (Fig. 5.1.1), where within one graph there are two subgraphs 
which can diverge independently. 

As we saw from examples, we must subtract off subdivergences before 
finding the overall divergence. In view of the complications when the 
subdivergences themselves have subdivergences, etc. (ad nauseam), we must 
be extremely precise as to what is to be done. This is what we will now do. It 
is helpful to have a specific non-trivial example in mind, to make sense of the 
mathematics. Such examples are treated in subsection 5.3.3 and in 
Section 5.4. The reader should try to read these sections concurrently with 
the general treatment in this section. 

First, let us define a specific divergence as being the divergence occurring 
when the loop momenta on a certain set of lines get big, with the momenta 
on other lines and the external momenta staying finite. Whether or not a 
given set of lines has a divergence associated with it is determined by power- 
counting. A divergence is thus associated with a certain subgraph. (At this 
stage, we do not require that the subgraph be connected.) 

Ifa graph G diverges when all its lines get large loop momenta, it is said to 
have an overall divergence. A one-particle-reducible graph (like any of 
Fig. 3.2.1) cannot have an overall divergence — some lines are not a part of 
any loop. All other divergences involve a smaller subset of the lines. They, of 
course, are called subdivergences. Every subdivergence of a graph is the 
overall divergence of one of its subgraphs. 
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Observe that Fig. 5.1.1(a) has no overall divergence, but has three 
subdivergences. These come from the regions in the integration over loop 
momenta where: (1) the left-hand loop has large momentum, (2) the right- 
hand loop has large momentum, and (3) both loops have large momenta. 

To renormalize a graph G we assume that we know how to renormalize 
its subdivergences, and we then let R(G) be the unrenormalized value of G 
with subtractions made to cancel the subdivergences. Then the only 
remaining divergence that is possible is an overall divergence. So we define 
an overall counterterm: 

C(G) = — T-R(G) (5.3.6) 


by applying to R(G) the same subtraction operator T as we discussed 
earlier; if there is no overall divergence (e.g., if G is one-particle-reducible) 
then C(G) is zero. In any event the renormalized value of G is defined as 
R(G) = R(G) + C(G). (5.3.7) 
The definitions (5.3.6) and (5.3.7) give us R(G) provided that we know how 
to subtract subdivergences. This is essentially a matter of renormalizing 
smaller graphs; we will construct R(G) in a moment. Once we have done 
this, we will have a recursive definition of R(G): successive application of 
(5.3.7) to smaller and smaller subgraphs ultimately brings us to graphs with 
no subdivergences. These we know how to renormalize. | 
Now let us define R(G), which is to be U(G) with subdivergences 
subtracted. For the case of a graph with no subdivergences we must define 


R(G) = U(G) (if G has no subdivergences). (5.3.8a) 
For a larger graph we define 
R(G)=U(G)+ Y CG). (5.3.8b) 
GG 


We sum over all subgraphs y of G, other than G itself, as indicated by the 
notation y ¢ G. The other new notation C,(G) means that we replace the 
subgraph y by its overall counterterm, as defined by (5.3.6), i.e., 


l (5.3.9) 


—ToR(y), if y has an overall divergence 
Chy) = 
0 if y has no overall divergence 


To make a simple formula, we write the sum as being over all y’s rather than 
only over divergent y’s; then the C,(G)is zero if y is not overall divergent. We 
could of course restrict the sum only to those subgraphs that have an 
overall divergence. 

One tricky point in the above equations arises in defining C(y) for a 
disconnected subgraph y. An example is the subgraph of Fig. 5.1.1(a) 
consisting of the two self-energy loops. We will discuss this next. 
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Equations (5.3.6) to (5.3.9) give a definition of R(G). Let us see how they 
apply to simple examples. For a 1 PI graph with no subdivergences they just 
reproduce R(G) = U(G) — T» U(G). 

Next consider a graph like Fig. 3.2.1(a) or (c), whose only divergence is a 
subgraph with no further subdivergences. Then there is no overall 
divergence, so by (5.3.7) 


R(G) = R(G). (5.3.10) 
There is only one subdivergence, so (5.3.8) collapses to give 
R(G) = U(G) + CG), (5.3.11) 


where y is the divergent subgraph. Here C,(G) is the full graph with y 
replaced by — ToU(y). We reproduce the obvious result. There is one 
counterterm graph like Fig. 3.2.2(a) or (c). 

We now look at a graph with two or more divergent subgraphs which do 
not intersect and which have no subdivergences. It is sufficient to consider G 
to be Fig. 5.1.1(a). There is no overall divergence, so again R(G) = R(G). Let 
y, and y, be the self-energy bubbles. Then the subdivergences correspond to 
the three subgraphs y,, y,, and y; Uy. (Here y; Uy, means, as usual, the 
union of y, and y,.) So 


R(G) = R(G) = U(G) + Ci (G)+C,(GQ+C G). (5.3.12) 


yiu oa 


Evidently C, (G) is just U(G) with y, replaced by its counterterm, 
— T- U(y,); and similarly for y,. But what is C, uy, (G)? 

It corresponds to a subtraction for y, Uy, for the region where all loop 
momenta are large. But we must subtract from it the counterterms for the 
regions where only one momentum is large: 


CY yo) = — Te [U(y)U(y2) + Cy JU(y2) + UY )C(y2) J. (5.3.13) 
Here we used the fact that y, Uy, is disconnected, so that 
U(y, U Y2) = Ul(y,)U(y2). (5.3.14) 


To work out (5.3.13), we must define T when acting on a disconnected graph 
to act independently on its components. Thus: 


To[U(y )U(y2) ] = [Te Uy) ] LT? U2) ] = Cy C2), (5.3.15) 
To [C(y,)U(y2)] = [Te C) J[Te U(y2) ] = — CC), (5.3.16) 


etc. 
We used the property that Te U(y,) = — C(y,). Furthermore, Te[ Ts U(y;) ] = 
T- U(y,), i.e., the pole part of a pole part is itself. We therefore find 
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that 
Cy; Uy) = [Te U(y,) ] [Te U(y,) ], (5.3.17) 


so that we reproduce the counterterm graph Fig. 5.1.1(d). 
The above procedure generalizes to an arbitrary graph. It may appear 
excessively complicated, but it allows the smoothest way of defining R(G). 
Finally, we observe that our definitions (5.3.6)—(5.3.10) exactly reproduce 
our results for the two-loop graphs like Figs. 5.1.2 and 5.1.3. 


5.3.4 Summary 


In this section we have proved very little. We have set up a series of 
definitions that state exactly what we mean by the renormalization of a 
Feynman graph. The notation we have introduced will be important in 
making proofs. What we will need to prove is that the overall counterterms 
are local and of a degree in momentum given by naive power-counting. We 
will also show how to solve the recursion to find an explicit formula due to 
Zimmermann (1969). 


5.4 Three-loop example 


The three-loop self energy graph of Fig. 5.4.1 in ¢° theory in six dimensions 
is an example of a graph with nested and multiply overlapping divergences. 
We call it G. Its divergent subgraphs are: 


yı = {lines carrying loop momentum k}, 

y, = {lines carrying loop momentum q}, 

y, = {lines carrying loop momenta k and/or J}, 

y4 = {lines carrying loop momenta q and/or J}, 

Ys = Y1 U Y2- (5.4.1) 
The first four of these are connected 1PI vertex graphs; the last is a set of 
two unconnected vertex graphs. According to our definitions of Section 5.3 
we have 


R(G) = R(G) + C(G) 
= R(G) — T-[Ř(G)]. (5.4.2) 


pt+l 


Fig. 5.4.1. Three-loop self-energy graph in ¢° theory. 
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This equation states that we first subtract subdivergences to obtain R(G), 
and then take off the overall divergence. 


To define R(G) we subtract ee 


R(G)=G+ 2 C, (G) 


i=1 


=G+)G| (5.4.3) 
z> CO) 
We represent this as Fig. 5.4.2. The notation with the vertical bar in 
this equation denotes that we take G and replace y; by the corresponding 
counterterm C(y,). In the figure, the labels 1,...,4 signify which of the 
subgraphs y,,...,y, has been replaced by its counterterm. 


~ <] >-<D 
ORO 
O-O 


Fig. 5.4.2. Subtraction of subdivergences of Fig. 5.4.1. 


Only y, and y, have no further subdivergences, so C(y,) and C(y,) are the 
ordinary one-loop counterterms. But we have still to define C(y,) for i = 3, 4, 
5: 

C(y3) = — To[ y; — 73 eerie 

C(y4) = — To [y4 E Palaro: | 

C(ys) = — Te[y1 y2 — TO )?2 — 71 T(72)] | 
=[-T)JL-T2)]. (5.4.4) 


The overall result is obtained by combining (5.4.2)-(5.4.4). If we represent 
the effect of applying T to a 1PI graph by enclosing it in a box, we can write 
R(G) as shown in Fig. 5.4.3. There are sixteen terms in all. The first eight 
represent U(G) minus its subdivergences, and the last eight form the 
subtraction for the overall divergence. The expansion of R(G) represented in 
Fig. 5.4.3 is an example of the forest formula, to be discussed in the next 
section. 

As we saw by examining two-loop graphs, the overall counterterm for a 
graph is non-local unless we first subtract off subdivergences. Otherwise 
there would be divergent contributions from where some but not all 
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| [Previous | 
— | 

| (8 graphs) | 

È ee es 


Fig. 5.4.3. Renormalization of Fig. 5.4.1. 


subgraphs have large loop momenta. There would also be divergent 
contributions from the region where all loop momenta get large but at 
different rates. We can check from Fig. 5.4.3 that none of these problems 
occur for R(G). Let us do this explicitly. 

Let us show that the overall counterterm for G is local. We differentiate 
R(G) three times with respect to the external momentum p and show that 
the result is finite. Given the momentum routing of Fig. 5.4.1, there are three 
lines to differentiate: p + k, p + l, p + q. Here we have used the momentum 
carried by the line as a label for the line. Differentiating the original graph 
gives ten terms, where the three derivatives are applied to any combination 
of the three lines. One term is where we differentiate p + k three times (Fig. 
5.4.4). Although there is then no overall divergence, there remain sub- 
divergences, so we must examine the corresponding differentiations applied 
to the counterterm graphs (b)—(h). We must regard the derivatives as acting 
on these graphs before divergences are computed (by the operation 
symbolized by the box). 

The differentiation makes the subgraphs y, and y, completely finite, by 
removing both their overall divergence and y,’s only subdivergence. Hence 


<D- 


Fig. 5.4.4. One of the terms obtained by differentiating Fig. 5.4.1 three times with 
respect to its external momentum. 


5.5 Forest formula 109 


the counterterm graphs (b), (d), (e), and (h) are zero after differentiation. This 
leaves graphs (c), (f), and (g). These cancel the subdivergences of (a) coming 
from the subgraphs y, and y,, which are unaffected by the differentiation. 

We may examine the other nine terms in 0°R(G)/dp* similarly, and we 
find that in fact 0°R(G)/dp?° is finite, as claimed. 


5.5 Forest formula 


5.5.1 Formula 


Zimmermann (1969, 1970) gave an explicit solution of the recursive 
definition of the renormalized value R(G) of a graph G. The general idea can 
be gleaned from the example we examined in the previous section. There the 
recursion generated a series of sixteen terms. One was the original graph, 
and the others had the subtraction operation T applied one or more times. 
For example, in graph (e) we first replace the left-most loop y, by T(y,), with 
a result we can write as T, (G). We then take the subgraph equivalent to y3, 
viz. T, (y3), and replace it by the result of acting with T. This gives graph (e). 
The sum of the two graphs (d) and (e) is used as the subtraction for the 
subdivergences of G associated with y4. 

Each of the sixteen terms is pictured as the original graph with some 
number of connected 1PI subgraphs enclosed in boxes to indicate 
application of T to the subgraph. Each term can be specified by giving its set 
of boxed subgraphs. Each such set is called a forest. The subgraphs which 
form a particular forest are either disjoint or nested : they are said to be non- 
overlapping. The set of all possible forests for G is called Z(G). 

There are sixteen forests occurring in Fig. 5.4.3. The first eight are (in 
set theory notation): 


(a) the empty set Ø, 
(b) ty} 

(c) {72} 

(d) {73} 

(e) {Yi V3} 

(A) {ya} 

(g) ail: 

(h) {1V2}. 


These do not contain the whole graph; they are called the normal forests. 
The other eight forests in Fig. 5.4.3 consist of one of the above eight forests 
to which is added as another element the complete graph G. A forest of G 
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containing G 1s called a full forest. The distinction between normal and full 
forests is that the normal forests subtract off the subdivergences, and the full 
forests combine to subtract the overall divergence. 

Not all forests occur in Fig. 5.4.3; for example, the forest 


U = {subgraph consisting of lines carrying loop momentum /} 


does not appear. Such forests contain at least one overall convergent 
subgraph as an element. 

Inspection of Fig. 5.4.3 shows that 

RG)= > [](—T,G. (5.5.1) 

Uc F(G) yeU 

Here the sum is over all forests U of G. The operator T, replaces y by T(y). 
Note that for nested y’s the T,’s should be applied inside to outside. 
Equation (5.5.1) is called the forest formula; it is due to Zimmermann 
(1969). Suppose we compute R(G) for an arbitrary graph G by using (5.5.1). 
Then, as we will prove shortly, the result is the same as if we used the 
recursive definition of R(G) given in Section 5.3. 

It is convenient to let the sum over forests be over all forests rather than 
only over those consisting of subgraphs that are superficially divergent; the 
extra forests give a zero contribution. The reason for doing this is that we 
will sometimes wish to change the definition of T so that we make 
subtractions for some convergent graphs, as well as for divergent graphs. 
For example, such a redefinition will be the key to proving the operator 
product expansion in Chapter 10. 

We now have both a recursive and a non-recursive definition for the 
renormalization of a Feynman graph. It will prove very useful to have both 
definitions available. Different proofs will need different forms of the 
definition. In particular, proofs by induction on the number of loops of a 
graph will naturally use the recursive definition. 


5.5.2 Proof 


The proof of the forest formula is elementary, but somewhat involved. We 
first use (5.5.1), and the following equations: 


RO= YJ] (-T)G: (5.5.2) 
Uc F(G) yeU 
STe RG. if G is 1PI, 
C(G) =« [| [CLC], if G is a disjoint union of 1PI ,’s,$ (5.5.3) 


0 otherwise, 
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as definitions of R(G), R(G), and C(G). Here F(G) is the set of normal forests 
of G (1.e., those that do not contain G). These definitions are correct for the 
graph Fig. 5.4.1, as can be seen by inspection of Fig. 5.4.3. 

Since the recursive definitions uniquely give R(G), R(G), and C(G) in 
terms of the operation T,, it suffices to show that (5.5.1)—(5.5.3) satisfy the 
recursion relations (5.3.6)-(5.3.9). Notice first that (5.3.6) and (5.3.9) are 
really the same, except for being applied to different graphs. 

If R(G) given by (5.5.2) is correct, and if subgraphs are correctly 
renormalized, then (5.5.3) is equivalent to our original definition (5.3.6) of 
C(G). Moreover, suppose that G is connected and one-particle-irreducible. 
Now each forest of G is either a normal forest, that is, a forest of which G is 
not an element, or it is a normal forest to which is adjoined G. Then the 
formula (5.3.7) for R(G) is a direct consequence of (5.5.1)—(5.5.3) for such a 
graph. If Gis not a union of 1PI graphs, then there is no overall divergence, 
and again (5.3.7) holds. 

So it remains to prove the following: 


(1) R(G) is correct when G is a disjoint union of more than one 1PI graph. 
(Note that this case occurs in renormalizing the graph of Fig. 3.2.1(b), as 
we saw in Section 5.3.) 

(2) R(G) is correct, i.e., it satisfies (5.3.8), for a general graph. 


If G is a disjoint union of 1PI graphs y,, then each forest is a union of 
forests, one for each component. Then R(G) = | | R(y,), as we should expect. 


The problem is that this is not manifestly true in the recursive definition, 
where we make an overall subtraction for G. We dealt with this problem 
between (5.3.11) and (5.3.17). 

Our proof of (5.3.8) is by induction on the size of G. Now a one-loop 1 PI 
graph has no non-trivial subgraphs, so its only normal forest is the empty 
set. Then formula (5.5.2) collapses to R(G) = U(G), just as it should. This 
enables us to start the induction. 

It remains to prove (5.3.8b). For this, observe that each forest U has a 
unique set of biggest subgraphs M,,...,.M,;. Each M; is contained in no 
bigger subgraph in U, and each ye U is contained in some M,. The existence 
and uniqueness of this set of M’s is seen by considering pairs y,, y, of 
elements of U. Since y; and y, are non-overlapping, there are three 
possibilities: 

(1) Yi © Y, in which case remove y, from further consideration. 
(2) Yk € Yp in which case remove y, from further consideration. 
(3) y;0 Yı = Ø, in which case leave both in. 
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Repeat until no further eliminations are possible; then the result is the set of 
M,’s. 

The forest U is the union of a full forest, one for each M,. We can write our 
definition (5.5.2) of R(G) as 


R(G)=G+t STC Tw) Do 
Mis., Mn Li=1 U;e #(M:1) Une FMn) 
«| I] -Toe l (-1,) le} (5.5.4) 
yıEUı YnEUn 


Here the first term comes from the case in (5.5.2) that U = @, and the sum in 
the second term is over non-empty sets of disjoint 1PI graphs M, excepting 
the case that M, = G. By setting y= M,v...UM, and using (5.5.3) to 
determine C(M, U...UM,), we find (5.3.8b). 


5.6 Relation to Z 


We have seen how to renormalize an individual Feynman graph by making 
a series of subtractions. The motivation for doing this came from 
consideration of examples in which the subtractions were generated by 
counterterms in the interaction Lagrangian. We will now show that this is 
true to all orders. We will assume the natural result (to be proved later) that 
the polynomial degree of the overall counterterm of a graph is given by its 
degree of divergence, just as for low-order graphs. 

First, we must make precise the result that we will prove. For each 1 PI 
graph G, we have constructed its overall counterterm C(G). Since this is a 
polynomial in the external momenta of G, it can be written as the vertex 
derived from an interaction term D(G)/N(G) in the Lagrangian Z. Here 
N(G) is asymmetry factor of the same sort as the 3! that appears with the ¢° 
interaction term in Z. Each power of a momentum entering D(G) 
corresponds to i times a derivative of the corresponding field. If G is an n- 
point graph and each of its external lines corresponds to the same type of 
field, then N(G) is n!. If there are a number of different fields and n; is the 
number of lines of type i entering G then 


N(G)= [] 7}! (5.6.1) 


For each graph for a Green’s function, the forest formula gives a set of 
graphs with counterterms. We will demonstrate that the set of counterterm 
graphs is generated from the counterterm vertices in the interaction 
Lagrangian. 

Consider ° theory. As before, we write the Lagrangian as: 


L=Ly +L, +} Lu=LotL&,. (5.6.2) 
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The free Lagrangian 
L = (0b)*/2 — m*?/2 (5.6.3) 


generates the propagator, while the interaction X, consists of two terms, 
L, and Za The basic interaction is 


Ly = —gp 4p? /33, (5.6.4) 


and Za is the counterterm Lagrangian used to cancel the ultra-violet 
divergences: 


L „= } D(G)/N(G). (5.6.5) 
G 


Here the sum is over all 1 PI graphs. Those that have no overall divergence 
generate no counterterm; for these D(G) = 0. Each 1PI graph that has an 
overall divergence generates a term in (5.6.5). The formulae (5.6.2) and 
(5.6.5) apply in any theory. 

Since (5.6.5) applies to any theory, it applies in particular to ¢° in higher 
than six dimensions. Thus it enables us to renormalize a non- 
renormalizable theory. But the sum must include counterterms D(G) with 
an arbitrarily large number of powers of momentum and with an arbitrarily 
large number of external lines for G. It is only in six or fewer dimensions that 
the counterterms have the same form as terms in the basic ¢* Lagrangian 
Di Legs 

Now that we have developed a convenient notation, the most difficult 
part of the proof is to ensure that the combinatorial factors come out right. 
We will prove that the Lagrangian defined by (5.6.2) and (5.6.5) gives the 
same renormalized Green’s functions as those generated by our recursive 
definition in Section 5.3 (and therefore the identically same Green’s 
functions as given in Section 5.5 by the forest formula). The proof will be 
given for °? theory in six or fewer dimensions, but it easily generalizes. 

Consider the full N-point Green’s function Gy at order g”. It is sufficient 
to work only with connected graphs. If the theory is renormalizable (as we 
will prove in Section 5.7), then the sum of counterterms has the form: 


P= 5Z(0b)?/2 — ôm?¢?/2 — 693/31. (5.6.6) 
with (by (5.6.5)) 
—6Z= ) [Coefficient of — ip? in C(G)], 


2-point G 
—dm*>= Ý [Coefficient of ip° in C(G)], 
2-pointG 
-~ig= Y CO). (5.6.7) 
3-point G 


We ignore the tadpoles, yet again. The term of order g” in the perturbation 
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expansion of Gy has vertices generated by the different terms in Z, + La 
There will be graphs with all of their vertices being the basic interaction Y,,. 
Let the set of these be called B. The other graphs will contain one or more of 
the counterterm vertices generated by (5.6.6). Each counterterm can then be 
decomposed into a sum of terms by applying (5.6.7) at each counterterm 
vertex. Each term has each of the counterterm vertices replaced by the 
overall divergence of some graph. Then in the result, each term T 
corresponds to a unique basic graph b(T)eB. 
So we have 


Gy=¥ (< + > r) (5.6.8) 
b(T)=G 


G 


On the other hand, we have constructed the renormalization of each of the 
graphs G by writing 


RG)=G+ Y C/G). (5.6.9) 


Each of the terms T in (5.6.8) is constructed by replacing each of a set of one 
or more disjoint 1PI subgraphs y,,...,y; by its counterterm given by 
iD,,(G). On identifying y in (5.6.9) with y, Uy."**Uy,;, we expect that 


G, = $ R(G). (5.6.10) 
G 


This result would be obvious, were it not that the symmetry factors do not 
manifestly match up. 

The problem is illustrated by Fig. 5.6.1. There the basic graph is (a), and 
the complete set of subtractions needed to renormalize it consists of (b)-(e). 
Now the symmetry factor for (a) is 1/8: There is a factor 1/2 for each self- 
energy graph and an overall 1/2 for the top—bottom symmetry of the whole 
graph. Each of the subtractions (b) and (c) has a symmetry factor 1/4, since 
the remaining factor 1/2 goes into the counterterm for the self-energy. Both 
graphs (b) and (c) are equal. 


(c) (d) 
Fig. 5.6.1. Renormalization of a graph, to illustrate symmetry factors. 
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Considered as Feynman graphs, these are the same graph, but with 
symmetry factor 1/2. So they give one term with factor 1/2 in (5.6.8) (derived 
from the Lagrangian), while in (5.6.9) (from the recursion formula) there 
are two equal terms with factor 1/4. The end result is the same. We must 
consider (b) and (c) as distinct graphs when defining R(G), since they 
correspond to different regions of loop-momentum space — we must take 
each momentum variable to be distinguishable. 

To construct a general proof is tedious. The symmetry factor of a graph G 
is 1/N(G), where N(G) is the dimension of the graph’s symmetry group. So 
we write 

G =G/N(G), (5.6.11) 
where the overbar indicates computation ignoring all symmetry factors. 
Similarly we define C(G) by 

C(G) = C(G)/N(G). (5.6.12) 
Now the renormalized value of a graph G is 


R(G)=G+ ¥ CLG) 


ySG 
G 
ANAS 
—_ G+ y {+ — || —_c@ |. 5.6.13) 
N(G) “ft iy 
In the last line we have observed that y is a disjoint union of 1PI graphs y,, 
)>,.... Moreover, we have explicitly indicated the symmetry factors 1/N; = 


1/N(y,) for each y; which is replaced by its overall counterterm. For a 
given subgraph y = Uy, the symmetry groups of the y,’s are a commuting set 
of subgroups of the symmetry group of G. Therefore the quantity 
N(G)/| | N; must be an integer. 


Next, consider the Green’s functions generated by the Lagrangian (5.6.2), 
as in (5.6.8), 


a= Theo Ea CO (5.6.14) 


Here we have observed that each graph containing one or more counter- 
terms is generated from a basic graph by replacing some 1PI subgraphs 
Y1» -> Y; by counterterms. We write y as the union of the y,’s. Then we let G/y 
be the graph resulting from substituting counterterms for the y,’s. By the 
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definition of the counterterm Lagrangian, the result is the same as C,(G), 
aside from symmetry factors. The prime on the )”’ indicates that only y’s 
giving distinct Feynman graphs are considered. (Thus, for example, 
Figs. 5.6.1(b) and (c) are not counted separately.) Thus, if we define 


K(G, y) = [number of graphs y’ for which G/y = G/y’], 
then we must prove that 
N(G) 


LT N, |EMGin) 


It is easiest to couch this final step in the language of group theory. The 
denominator of the right-hand side of (5.6.15) is the product of dimensions 
of commuting subgroups of the symmetry group of G. (Note that, for 
example, two p° counterterms generated by different self-energy subgraphs 
are counted as different.) These subgraphs generate another subgroup, of 
which the set of cosets in the symmetry group of G has exactly the 
dimension of the right-hand side of (5.6.15). But, concretely, each coset 
corresponds to one of the graphs counted by K(G, y). 


K(G,y) = (5.6.15) 


5.7 Renormalizability 


5.7.1 Renormalizability and non-renormalizability 


In this section we explain the properties of renormalizability, non- 
renormalizability, and super-renormalizability of a field theory. We do this 
first for every order of perturbation theory, and then we consider to what 
extent the properties are true beyond perturbation theory, for the complete 
theory. The method in perturbation theory is power-counting and dimen- 
sional analysis. 

Consider first 6° theory in a space-time of integer dimension dọ. We have 
seen how to renormalize it to get finite Green’s functions by adding 
counterterms (5.6.5) to the Lagrangian. Each counterterm is a polynomial 
in the field ¢ and its derivatives. The theory is called renormalizable if the 
only counterterms needed are proportional to the terms (0¢)’, 6”, and ¢° 
present in the original Lagrangian Z, + ,. This is equivalent to saying 
that the Lagrangian has the form 


L = (0)? /2 — mog5/2 — Jo¥o/3!, (5.7.1) 


where the bare field ¢, is Z'/?. The bare mass mo, the bare coupling gy, and 
the field-strength renormalization Z each have singular behavior as the 
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ultra-violet regulator is removed. A linear term dh@ is needed as well. We 
may regard it as being present in the original Lagrangian. In any event it is 
only a single extra coupling. It can be ignored if we impose the 
renormalization condition that ¢0|¢|0> =0 to determine 6h, and if we 
ignore tadpole graphs. 

We generalize to an arbitrary theory by calling a theory renormalizable if 
the Green’s functions of its elementary fields can be made finite by rescaling 
the fields (in a cut-off dependent way) and by making some suitable cut-off 
dependent change in the couplings and masses. 

In perturbation theory we determine whether or not we have re- 
normalizability by examining the possible values of the degree of diver- 
gence 0(G) for the 1 PI graphs. For every graph G acounterterm is needed if 
0(G) >0. As we will prove in Section 5.8 the counterterm C(G) is a 
polynomial of degree 6(G) in the external momenta, and provided we use a 
scheme like dimensional regularization that preserves Poincare invariance, 
the counterterms are Poincare invariant. 

Let us now determine whether or not ¢° theory in d space-time 
dimensions is renormalizable. In d space-time dimensions the N-point 1PI 
graphs have dimension (in momentum space) 


d(Gy) = N +d — Nd/2. 

Then (by (3.3.12)) the degree of divergence of a graph for Gy at order g” is 

ô(Gy) =d + (1 — d/2)N + (d/2 — 3)P. (5.7.2) 

Note that the minimum value of P to have a one-loop connected graph 
is N: 

Inspection of (5.7.2) shows that if d > 6 then, for any value of N, there can 

be made N-point graphs with arbitrarily high degree of divergence by going 

to large enough order in g. The theory is therefore not renormalizable if 


d > 6, and the non-renormalizability is a direct consequence of the negative 
dimension of g. 


and tadpole graphs to 4 loops 


Fig. 5.7.1. All the graphs with overall divergences in ¢° theory at those space-time 
dimensions where it is super-renormalizable. 
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If d = 6, only the one-, two-, and three-point functions are divergent, with 
degree of divergence 4, 2, and 0, respectively. The permissible counterterms 
are just terms of the form of those in Z+ %,, so the theory is 
renormalizable if d = 6. Moreover, there is a divergence in every order of g 
(except for tree graphs, of course). 

If d = 3, 4, or 5, then only a finite set of graphs, illustrated in Fig. 5.7.1, 
have overall divergences, and renormalization is needed only for the mass 
and for the tadpole coupling. Again we have renormalizability. 


5.7.2 Cosmological term 


Strictly speaking, we should also consider Feynman graphs with no 
external lines. These are the vacuum bubbles. They generate the energy 
density of the vacuum, and normally are ignored. But in gravitational 
physics, they cannot be ignored. Counterterms for such graphs (present in 
p? theory whenever d > 2) are proportional to the unit operator. They area 
renormalization of what in General Relativity is the cosmological constant. 
A counterterm is even needed for free-field theory — where the divergence is 
conventionally removed by normal-ordering (see, for example, Bjorken & 
Drell (1966)). We see that normal-ordering is nothing but a primitive form 
of renormalization. 


5.7.3 Degrees of renormalizability 
It is convenient to distinguish three types of renormalizable theory: 


(1) Finite: no counterterms needed at all. 

(2) Super-renormalizable: only a finite set of graphs need overall counter- 
terms. 

(3) Strictly renormalizable: infinitely many graphs need overall counter- 
terms. (But note that they only renormalize a finite set of terms in the 
basic Lagrangian, since we assumed renormalizability of the theory.) 


Finiteness or super-renormalizability normally occur when all the coup- 
lings in the basic Lagrangian have positive dimension. 

Note that in a super-renormalizable theory, the number of divergent 
basic graphs is infinite. For example, even if there is only one graph y with 
an overall divergence, any graph containing y as a subgraph is divergent. 
However all such graphs become finite after adding to y its counterterm, so 
only one counterterm, C(y), appears in the Lagrangian. 

Mathematical physicists (see Glimm & Jaffe (1981)) have investigated 
renormalizability beyond perturbation theory. This is important, since 
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perturbation series are in general asymptotic series rather than convergent 
series. Thus one cannot simply sum the perturbation series to obtain the 
complete theory. Even so, it has been proved for many super- 
renormalizable theories that perturbation theory gives an exactly correct 
account of the divergences. (A much investigated case is ¢* theory in two 
and in three space-time dimensions.) 

In asuper-renormalizable theory, the series for a bare mass or coupling in 
terms of the renormalizable quantities has a finite number of terms. 
Therefore the series converges, and one only has to prove that (a) 
perturbation theory is asymptotic to the true theory, and (b) there are no 
terms like exp( — 1/g) in the bare masses or couplings that are smaller than 
any power of g. The rigorous proof amounts to showing that in summing 
the perturbation series to a finite order, the error is correctly estimated by 
the first term omitted. In particular, this applies to the existence of any 
possible ultra-violet divergence. 

Rigorous proofs are not yet available for any strictly renormalizable 
theory. One difficulty is obvious: the series for, say, the bare coupling, g, is 
an infinite series, each term of which diverges as the UV cut-off is removed. 
Since the series is presumably asymptotic rather than convergent, one 
cannot directly obtain any information about renormalization in the full 
theory: the error obtained in using a truncated form of the series is of the 
order of the first term omitted, and that is always divergent. 

It might even appear that perturbation theory has no light at all to shed 
on the question of renormalizability of the full theory. This is in fact not so, 
as we will see when we discuss the renormalization group in Chapter 7. If 
the theory has the property called asymptotic freedom then a series of 
suitable redefinitions of g allows short-distance phenomena to be computed 
reliably. In particular the UV divergences can be computed in terms of 
weak coupling series without divergent coefficients. It is sensible to 
conjecture that a suitably refined analysis can be made to obtain rigorous 
bounds of the errors so that the perturbative results correctly give the 
divergences. Monte-Carlo studies of the functional integral (Creutz & 
Moriarty (1982)) support this conjecture. In four dimensions, only certain 
non-abelian gauge theories (including QCD) are asymptotically free (Gross 
(1976)). 

We will also see in Chapter 7 that in non-asymptotically free theories, 
like * and QED in four dimensions, perturbation theory cannot reliably 
describe short-distance phenomena. There are, in fact, indications 
(Symanzik (1982)) that the full t theory is not renormalizable, contrary to 
the situation order-by-order in perturbation theory. 
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5.7.4 Non-renormalizability 


For theories which are not renormalizable in perturbation theory, there are 
many possibilities. Among them are the following: 


(1) There is only a finite set of 1PI Green’s functions which have overall 
divergences. A typical case is °? theory in six or fewer space-time 
dimensions when the basic Lagrangian, 

L =(0)*/2 — m’*o?/2 — g¢?/3!, (5.7.3) 
has no term linear in @. The one-, two-, and three-point functions have 
divergences, but there is no term hd whose coupling can be re- 
normalized to cancel the divergence of the tadpole graphs. However, 
addition of such a term generates a renormalizable theory. More 
generally, suppose we have a finite set of overall-divergent Green’s 
functions. A renormalizable theory is generated by adding a finite set of 
extra interactions. 

(2) There is an infinite set of Green’s functions with overall divergences. 
However, for all but a finite set of the Green’s functions, the divergences 
cancel after summing over all graphs of a given order. (There are no 
known cases of this.) 

(3) As for case 2, except that the divergences cancel only for the S-matrix, 
rather than for all off-shell Green’s functions. An important case is a 
spontaneously broken gauge theory, when it is quantized in its unitary 
gauge. : 

(4) The theory is made renormalizable by going beyond perturbation 
theory in some systematic and sensible way. One case (as in the Gross— 
Neveu (1974) model — see Gross (1976)) is of a theory that is strictly 
renormalizable and asymptotically free for some dimension d = dy, and 
that is considered in some dimension d slightly greater than dp. 

(5) Asfor case 1, except that the extra terms make physical nonsense. A case 
is the Yang-Mills theory with a mass term in Feynman gauge. Then the 
extra terms destroy unitarity (t Hooft (1971a)). 

(6) None of the above. 


Roughly speaking, there are no general rules. Each case must be handled 
separately. Only for the last two cases (5 and 6) should a theory be called 
non-renormalizable. A fundamental theory should be renormalizable, for 
otherwise either physical quantities are actually infinite or they are finite, 
but an infinite set of parameters is needed to specify the finite parts of the 
counterterms. 

Nevertheless, a statement that a particular theory 1s non-renormalizable 
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is really a statement of ignorance: nobody has found a way to construct a 
physically sensible version of the theory. (Cases 1 to 5 are where somebody 
has found a way.) In practice, when a theory is labelled non-renormalizable, 
what is usually meant is that the theory is not renormalizable order-by- 
order in perturbation theory; such a statement can be proved by calculating 
a finite number of graphs. 

Within the usual functional-integral approach (with a lattice cut-off), not 
only has the complete ¢* theory been proved renormalizable for d = 2 and 
3, but it has been proved non-renormalizable for d > 4 (Aizenman (1981)). 


5.7.5 Relation of renormalizability to dimension of coupling 


To prove perturbative renormalizability of a theory of scalar fields, we 
generalize the argument leading to (5.7.2). The argument will apply when no 
coupling has negative dimension. Renormalizability will hold with possibly 
the addition of extra interactions (like the hg term in ¢° theory) whose 
coefficients have non-negative dimension. Our proof will easily generalize 
to theories with fermion and gauge fields. The problems we will encounter 
in gauge theories will all be to do with the question of whether these extra 
terms are compatible with the gauge invariance. But we will leave these 
questions to Chapter 12. | 
Let a general term in Z or Z be written schematically as 


(coupling {)(derivative)‘ (field)*. (5.7.4) 
The vertex generated by this term is one possible graph for the 1 PI Green’s 
function I, with N external lines. Thus the dimension of I’, satisfies 
d(T y) =d(f) + A. (5.7.5) 
Since no coupling has negative dimension, the degree of divergence of any 
graph for I’, is at most d(T y), as we saw from examples in Section 3.3.3, and 
as we will prove in Section 5.8. That is, the degree of divergence 6(I-y) 
satisfies 


(Py) < dy), (5.7.6) 


with equality only for graphs all of whose couplings have zero dimension. 
To renormalize the N-point graphs, we add counterterms of the form 
(5.7.4) with at most 6([,) derivatives. So the possible counterterms satisfy 


d(f) = dE) — A >d('y) — A 20. (5.7.7) 


The last inequality follows since a counterterm with A derivatives is needed 
only if the degree of divergence is at least A. From (5.7.7) it follows that we 
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need no couplings of negative dimension, given that none are present in the 
original Lagrangian. 

Some of the couplings generated as counterterms may not be present in 
the original Lagrangian even if it contains no couplings of negative 
dimension. But the number of new couplings needed is nevertheless finite, 
because only a finite set of counterterms satisfy (5.7.7). 


5.7.6 Non-renormalizable theories of physics 


From the discussion above, it is natural to conclude that a theory of physics 
should be renormalizable. In fact, the strong, electromagnetic, and weak 
interactions appear to be described by a renormalizable theory. This theory 
is a combination of quantum chromodynamics for strong interactions and 
the Weinberg—Salam theory for weak interactions. 

Around 1970 there was a revolution in the theory of weak interactions 
when it was discovered that non-abelian gauge theories are renormalizable. 
It is precisely one of these theories that was found to be necessary to 
construct a renormalizable theory of weak interactions in agreement with 
experiment. See Beg & Sirlin (1982) for a historical review. 

Unfortunately, this progress has not extended to gravity. Einstein’s 
theory of general relativity is non-renormalizable, after quantization, and 
there is no very promising alternative. (This situation exists despite many 
significant attempts to improve it - Hawking & Israel (1979).) 

It is a mistake to suppose that non-renormalizable theories should be 
banished from consideration. Remember that for many years weak 
interactions were successfully calculated using the four-fermion’ theory, 
which is non-renormalizable. For most purposes, weak interactions could 
be adequately treated in the lowest order of perturbation theory, where no 
renormalization is needed. But the non-renormalizability of higher-order 
calculations raised the question of consistency of the theory: is it legitimate 
to calculate even an approximation to a nonsensical (i.e., non-existent) 
theory? Will the results of calculations mean anything? The same questions 
arise in gravity. There, the classical theory of general relativity is very 
successful, but the quantized theory is badly non-renormalizable. 

We must therefore understand how and why we may use non- 
renormalizable theories in physics. 

Now, to perform consistent calculations in any theory which contains 
ultra-violet divergences, we must impose an ultra-violet cut-off, M, of some 
sort. In the case of a renormalizable theory we can take M to infinity and 
obtain finite results that are insensitive to the cut-off. Another related 
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property of a renormalizable theory is the decoupling theorem of 
Appelquist and Carazzone, which we will discuss in Chapter 8. This 
theorem applies to a renormalizable theory which contains fields whose 
masses are much bigger than the energies of the scattering processes under 
consideration. The theorem states that the heavy fields can be deleted with 
only a small effect (suppressed by a power of the heavy mass) on cross- 
sections, etc. The hallmark of a renormalizable theory is in fact that it is 
complete in itself. It contains no direct indications of whether it is only part 
of a larger and more complete theory. 

These statements are false for a non-renormalizable theory. Consider the 
old four-fermion theory of weak interactions. Its coupling is G ~ 107° 
GeV”. We cannot take the UV cut-off arbitrarily large, for an nth order 
graph has a divergence of order 2n; it behaves like (M?G)" for large cut-off 
M. Counterterms to make the graph finite need a correspondingly large 
number of derivatives, but only a finite number of counterterms are 
available. Hence we cannot take the cut-off to infinity, and if we want 
insensitivity to the cut-off we must take M $ G- '/?. Moreover, the energy, 
E, of the process under consideration must be much less than M, otherwise 
the calculation is dominated by details of the cut-off procedure. In other 
words, the four-fermion interaction is a good approximation to physics 
only if E <M < G- °. The minimum possible relative error of calculations 
is of the order of the maximum of M?G and E?/M?. 

Now, it is always possible in principle to do experiments at arbitrarily 
high energy. So the applicability of four-fermion theory at low energies 
implies that at energies rather below G7 +% ~ 300 GeV there is new physics. 
That is, the four-fermion theory becomes incorrect at that energy. The last 
fifteen years of weak interaction physics confirms this. (See, for example, 
Bjorken (1982)) 

For gravity, the corresponding energy scale is the Planck mass, of the 
order of 10'? GeV. This is extremely far beyond the range of normal 
accelerator experiments. Evidence for phenomena on such an energy scale 
must come from much more esoteric observations. Examples might be 
found in certain areas of the cosmology of the early universe, or from seeing 
the decay of a proton (Langacker (1981)). 

In any case, a non-renormalizable theory contains indications that it 
cannot describe all phenomena. It contains the seeds of its own destruction 
as a viable theory of a field of physics. So, given a successful non- 
renormalizable theory, one must ask the following questions: (1) ‘Of which 
more complete theory is it a part?’ (2) ‘How is it related to that theory?’ An 
example is given by the relation between the Weinberg—Salam theory, 
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>X 
Fig. 5.7.2. W-boson exchange gives an effective four-point interaction at low 
energies. 


which is renormalizable, and the four-fermion theory, which is not. The 
four-fermion theory arises as an approximation to W-boson exchange at 
low energy (Fig. 5.7.2). One replaces the propagator 


i/(q? — my), 
for the W-boson, by 
i/(— my). 
The graph is suppressed by factor of at least E?/m?, compared to photon 
exchange. It gives an example of the decoupling theorem: the heavy 
particles have small effects at low energies. 

The only reason we can see such effects is the high degree of symmetry of 
the strong and electromagnetic interactions. These interactions conserve P, 
C, T, and the number of each flavor of quark and of each flavor of lepton. 
Weak-interaction amplitudes are much smaller than strong-interaction or 
electromagnetic amplitudes for similar processes, and are therefore nor- 
mally invisible. But there are many processes that are completely forbidden 
in the absence of weak interactions; for these, any weak-interaction cross- 
section, no matter how small, is all there is. 

So one important way in which a non-renormalizable theory arises is as a 
low-energy approximation to a renormalizable theory in a process that is 
forbidden in the absence of the heavy fields. The heavy fields effectively give 
a cut-off on the non-renormalizable theory. Then, for example, the four- 
fermion coupling G is computable in terms of the underlying theory via a 
formula like 


G = constant g*/mj;, + higher order corrections in g. 


Here g is the dimensionsless coupling of the Weinberg—Salam theory. One 
manifest characteristic of this non-renormalizable theory is the weakness of 
its interactions. Also note that a higher power of Gis a higher inverse power 
of m;,. We are taking the leading power of my, as mẹ gets large, so it is in 
general incorrect to calculate in the non-renormalizable theory beyond 
lowest order. Higher-order calculations must be done in the full theory. 
A slightly different situation arises in gravity. There one must perform 
calculations beyond tree approximation, since gravitationally bound states 
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like the solar system are formed by multiple exchange of gravitons. 
Counterterms are generated involving higher-derivative interactions (e.g., 
R’, R?,. etc.). The ambiguity in the finite parts of these counterterms gives 
an uncertainty in the Green’s functions. However the uncertainty is a power 
of momentum divided by some large mass scale, and is negligible for low- 
momentum-transfer processes. In weak interactions, the size of the higher- 
order corrections is of the same order of magnitude as the intrinsic error in 
the calculations, but in gravity this is not so because of the zero mass of the 
graviton. 

Another difference is that gravity is actually the strongest of the four 
fundamental interactions when considered on a large enough scale. In 
contrast, on atomic or molecular scales, it is the other three interactions 
that are by far the strongest. However, the strong and weak interactions 
have a finite range, so that they are essentially zero outside the nucleus. 
Particles can have both signs of electric charge, so that bulk matter, if 
charged, tends to attract charge of the opposite sign to it. Bulk matter is 
therefore generally neutral. But gravity couples to mass or energy, so it is 
always attractive. Hence gravity wins out as the strongest interaction for 
large enough assemblages of matter. However at nuclear and atomic scales, 
it is negligible by a factor of about 10*° compared to the other interactions. 

Let us summarize by restating the key conclusion about the distinction 
between renormalizable and non-renormalizable theories. A non- 
renormalizable theory considered at low energy gives some indications that 
at high enough energies it must break down, and cannot be a complete 
theory. A renormalizable theory gives no such indication. 


5.8 Proof of locality of counterterms; Weinberg’s theorem 


In our examples, we saw that the counterterm C(G) of a graph G is a 
polynomial in its external momenta, of degree equal to its overall degree of 
divergence 6(G). This is a general property, as we will now prove. 

The original proof of this theorem and some related results is due to 
Weinberg (1960); a simpler proof was given by Hahn & Zimmermann 
(1968). It is useful to distinguish three results: 


(1) Suppose that a 1PI graph G and all its 1PI subgraphs have negative 
degree of divergence. Then the graph is finite. That the degrees of 
divergence of the graph and subgraphs are negative means that there is 
no divergence when all or some of the loop momenta go to infinity 
together, with the other momenta finite. The problem is to eliminate the 
possibility of a divergence from more exotic scalings. 
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(2) Suppose that a 1 PI graph G has negative degree of divergence, but that 
it might have subdivergences. Then the graph is finite if we first subtract 
off subdivergences. More simply, if 6(G) < 0 then R(G) is finite. 

(3) If a 1PI graph G has degree of divergence 6(G), then its overall 
counterterm C(G) is polynomial in the external momenta of G of degree 
6(G). 


Property (1) is a trivial case of (2). We will reduce (3) to (2) by the same 
differentiation method as we used in Section 5.2.2. The proofs will be by 
induction. This naturally suggests that we use the recursive definition of the 
renormalization R(G) of G. 

One generalization is useful. It is that the renormalization prescription 
may be chosen so that result (3) reads (t Hooft (1973), Weinberg (1973), and 
Collins (1974)): 


(3’) If a 1PI graph G has degree of divergence 6(G), then its overall 
counterterm C(G) is polynomial in the external momenta of G and in the 
massive parameters in the Lagrangian. (The parameters in question are 
the masses of fermions and the squared masses of bosons.) The 
dimensions of the terms in the polynomial are at most 6(G). 


5.8.1 Degree of counterterms equals degree of divergence 


We first prove Property (3), that the overall counterterm C(G) of a 1PI 
graph is polynomial in the external momenta of degree 6(G). We will do this 
assuming Property (2), that a graph with its subdivergences subtracted is 
finite if its degree of divergence is negative. Let G be a 1PI graph with degree 
of divergence 6(G) > 0. We will consider R(G), which is G plus counterterms 
for its subdivergences. Following Caswell & Kennedy (1982), let us 
differentiate the graph 6(G) + 1 times with respect to external momenta. 
This produces a result that has negative degree of divergence. We 
differentiate not only the graph G, but also its various counterterm graphs 
C,(G). The aim is to show that the result is actually convergent. To do this 
we will show that the differentiated counterterm graphs are the correct 
counterterm graphs for the differentiated original graph. This may sound 
obvious, but there are some subtleties, so we will give the details. 


Let the external momenta of G be p,,...,p,. Its renormalized value is 
R(G)=G+ f C/G) 
y SG 
= R(G) + C(G) 


= R(G) — T œ R(G). (5.8.1) 
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Let ô denote differentiation with respect to one of the external momenta, 
and let 0* denote any A-fold differentiation with respect to the external 
momenta. The property to be shown is that 6*R(G) is finite if 2 > 6(G). (It is 
clear from naive power-counting that 6(0*G) = 6(G) — A.) 

Suppose we ensure that differentiation commutes with the basic 
subtraction operator T,. This amounts to imposing a very natural relation 
between the finite parts of, for example, T,(0G) and T,(G). (The relation is 
satisfied by the pole-part subtractions, but it is possible to choose exotic 
renormalization prescriptions not satisfying the hypothesis.) Then for any 
graph y we have 


OC(y) = C(ey). (5.8.2) 
It follows that, for the original graph, we have 


0*R(G) = R(0*G). (5.8.3) 


The point here is that a differentiation when acting on a graph gives a 
number of terms, in each of which one of the propagators or vertices is 
differentiated. It is a simple generalization of the argument given in 
Sections 5.2.2 and 5.2.3 for specific graphs that the counterterms for 
subgraphs of G are the correct ones after differentiation in (5.8.3). 

Now R(07G) is the sum of a graph ôG that has negative degree of 
divergence and the counterterm graphs for its subdivergences. Hence by 
Property (2) it is finite, so that we may choose the subtraction operator T to 
give zero. Therefore the counterterm in (5.8.1) for the undifferentiated graph 
is polynomial of degree 6(G) in the external momenta. 

The same argument (Collins (1974)) also shows that counterterms are 
polynomials in mass. Here it is necessary to note that differentiation with 
respect to m? does not automatically reduce the degree of divergence. This 
only happens if counterterms for subdivergences are polynomial. If a 
counterterm has a piece proportional to ln (m°) then differentiating with 
respect to m? leaves the degree of divergence unchanged. The proof merely 
demonstrates that it is always possible to choose counterterms to be 
polynomial in m?; it is not compulsory. 


5.8.2 R(G) is finite if 6(G) is negative 


It was evident in one-loop examples that a graph with degree of divergence 
ô is renormalized by a local counterterm of degree ô in the external 
momenta. To generalize the result to an arbitrary graph, we constructed a 
renormalization procedure which involved computing the counterterm for 
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a 1PI graph only after subtracting subdivergences. We differentiated the 
graph 6+1 times with respect to its external momenta to prove its 
counterterm to be local and of degree 6. This proof relied on assuming the 
following statement: 


If a graph has negative degree of divergence and has its subdivergences 
subtracted according to the rules, then it is finite. More briefly, if ôT) < 0 
then R(T) is finite. 


This statement sounds extremely plausible. It is nevertheless in need of 
proof. We have to ensure that the subtraction procedure actually accom- 
plishes its purpose of removing the subdivergences. (That is, there are no 
spurious divergences induced by the procedure.) In addition, we normally 
only consider the divergences as arising from regions in which some loop 
momenta go to infinity, all at the same rate; this generates the usual power- 
counting. It is necessary to eliminate more exotic possibilities. 

The most important problem, which is the one we will examine, is to treat 
the case that a collection of loop momenta go to infinity, but at different 
rates. In Section 5.2.2, we examined the special case of Fig. 5.2.1. The 
general case is very similar. Inductively, we assume that properties (1) to (3), 
listed at the beginning of Section 5.8, are true for all smaller graphs than the 
graph G under consideration. We consider regions of the integration over 
_ loop momenta where all or some momenta go to infinity, not necessarily at 
the same rate. We will eliminate them as possible sources of additional 
divergences. If all the momenta go to infinity together, then the negative 
overall degree of divergence means that there is no actual divergence from 
this region. 

If some momenta stay finite while the others go to infinity (not necessarily 
at the same rate), then let y be the subgraph consisting of all those lines with 
the large momenta. Our inductive hypothesis ensures that all the resulting 
divergences are cancelled by counterterms for subgraphs. 

The remaining case is that all of the loop momenta go to infinity, but 
again not at the same rate. Let k denote the components of the smallest 
momenta, and let / denote the rest. (Our notation is meant to copy that used 
for Fig. 5.2.1, and so is the proof.) Let y be the subgraph consisting of all 
those lines carrying the loop momenta l. It may be a single 1 PI graph or a 
disjoint union of 1PI subgraphs. Let these 1PI subgraphs be jy,,...,;. 
Expand each subgraph in powers of its external momenta up to its degree of 
divergence. The remainder for each subgraph is really a graph with negative 
overall degree of divergence; the contribution vanishes as l goes to infinity, 
so we should have set 1 = O(k). The expanded terms contribute just as they 
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did for Fig. 5.2.1. After subtraction of divergences we have a factor in the 
integral over k corresponding to the dimension of the subgraph. 

We gloss over here some of the important details, notably what happens 
to the value of a general subgraph when some of its external momenta get 
large. But the main lines of the argument should be apparent. 

The structure of the proof is the same as in subsection 5.2.2 and the result 
is the same. 


5.8.3 Asymptotic behavior 


Weinberg (1960) not only proved the convergence theorem stated above 
with more complete rigor, but he also investigated what happens when 
several of the external momenta p,, p,,...,p, of a graph y get large in the 
Euclidean region. They are assumed all to be of an order Q, with the ratios 
pi/Q fixed as Q > œ. None of the sums of subsets of p} /Q vanish. Weinberg 
then states how to find the asymptotic behavior: 


(1) Consider any subgraph y connected to all the lines carrying the large 
momenta. Let all the loop momenta of y be of order Q. Compute the 
power of Q: Q%. 

(2) Look at all such subgraphs. Let a, have a maximum value a. 

(a) If there is a unique graph with this maximum power, then I œ Q* as 
Q>. 

(b) If there are several subgraphs with a, = a, then let N be the number of 
such subgraphs. The asymptotic behavior is: 


T = Q[A,By + A,B,InQ+ A,B,(InQ)? +: Ay_, By- (nQ) t] 
+ O(Q°7}). (5.8.4) 


Here the A,’s are functions of those momenta that are fixed as Q > œ 
and the B,’s are functions of the finite quantities p}/Q. 


This theorem is needed inductively in the guts of the convergence theorem 
proved in the last subsection. Its proof is similar. 

It is not obvious that this part of Weinberg’s theorem is of much use for 
physics, other than for its part in this convergence proof, since the 
asymptotic behavior is of Euclidean momenta. However, in the deep- 
inelastic scattering of a lepton on a hadron, there is a photon or a weak 
interaction boson that is far off-shell. The momentum carried by the boson 
is effectively Euclidean, and Weinberg’s theorem applies. We will see this in 
Chapter 14. There are also generalizations to other intrinsically 
Minkowskian situations (e.g. Amati, Petronzio & Veneziano (1978), Ellis et 
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al. (1979), Libby & Sterman (1978), Mueller (1978, 1981), Stirling (1978), and 
Buras (1981)). These are beyond the scope of the present book. 


5.9 Oversubtractions 


We showed how to renormalize a Feynman graph by making subtractions 
for the divergent subgraphs and for the overall divergence of the graph. It is 
possible, however, to make subtractions on graphs that are not divergent. 
Subtractions can also be made with a higher degree polynomial in the 
external momenta than called for by the degree of divergence. Either of 
these cases is called oversubtraction. Now, the general form of the 
renormalization, either by the recursive method or by the forest formula, 
did not specify the exact form of the subtraction operator T. So 
oversubtractions can be made without changing the general formalism. 
There are two important uses for oversubtractions. The first is when we 
wish to use ‘physical values’ of masses or couplings as the renormalized 
parameters. We will discuss this ina moment. The second use is to construct 
Operator product expansions. There, subtractions are made not only to 
cancel UV divergences but also to extract asymptotic behavior as some 
external momenta get large. We will discuss this later in Chapter 10. 


5.9.1 Mass-shell renormalization and oversubtraction 


We have considered renormalization as the procedure of removing 
divergences. Another point of view comes from the observation that one 
cannot observe directly the mass and coupling parameters that appear as 
coefficients in the Lagrangian. For example, consider a theory where each 
field has a corresponding single-particle state. Then the masses that are 
measured are those of the single particles, and it is often sensible to 
parametrize the theory in terms of these masses. Similar remarks can be 
applied to couplings. (Thus in QED one normally parametrizes the theory 
by the electron’s mass and charge, defined by the long-range part of its 
electric field.) It can also be convenient to rescale the fields so that each 
propagator has a pole of unit residue. 

In a simple renormalizable theory like ° in six dimensions the 
renormalizations to accomplish such a mass-shell parametrization are 
precisely those necessary to cancel the UV divergences. Thus we may define 
the subtraction operator applied to a self-energy graph X(p7) to be 


Ton? E(P?) = L(ms,) + (P? — m3,)='(m2,), (5.9.1) 
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so that the inverse propagator satisfies 

— ilp? — m3, — Z4(p7)] = —i(p? — min) + O(p? — m3), (5.9.2) 
as p? + m:,. We use the subscript ‘ph’ to indicate renormalization according 
to the mass-shell scheme. 

Of course, mass-shell renormalization is only one out of many re- 
normalization prescriptions that we may use to cancel UV divergences. But 
we may also choose to renormalize in the absence of divergences. Consider, 
as an example, ¢° theory again, but now in four dimensions. We may 
continue to use (5.9.1) and (5.9.2) for the renormalization of the propagator 
so we have a ‘physical’ parametrization. But all except the one-loop self- 
energy graph have no divergence, so all the wave-function counterterms are 
finite and all but one of the mass counterterms are finite. The combinatorics 
of the renormalization procedure as described earlier all work unchanged. 


5.9.2 Remarks 


One important technical problem is to check that the oversubtracted and 
the normally subtracted theories differ only by a reparametrization. This 
can be done by the methods which we will describe in Sections 7.1 and 7.2. 

In the previous subsection 5.9.1, we took the point of view that 
renormalization is the process of reparametrizing the theory in terms of 
‘physical’ quantities. It should be noted that this is not always a useful point 
of view. In the first place, other renormalization prescriptions are more 
convenient for handling certain types of calculation. In the second place, 
there may be infra-red divergences that make the mass-shell structure of a 
theory not what one would naively expect: thus in QED the electron’s 
propagator does not have a simple pole. And, finally, in some theories there 
are many more particles and couplings than independent parameters. This 
is very common in gauge theories. 


5.9.3 Oversubtraction on IPR graphs 


The aim of oversubtraction, generally, is to impose some condition on 
Green’s functions. So far, we have assumed the condition to be imposed on 
the 1PI graphs, since those are the ones needing counterterms for 
divergences. However, consider ¢° + ¢* theory: 


L =(0¢)?/2 — m?p?/2 —f 63/6 — gb*/24 + counterterms. (5.9.3) 


Let us choose to renormalize at zero external momentum. Thus the self- 
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(a) (b) (c) (d) 
Fig. 5.9.1. Subtraction of one-particle-reducible subgraphs. 


energy È and the three-point 1PI function T, satisfy 
dÈ 
(p* = 0) dp2? ) 


Fa (p? = Pz = p3 = 0) = lowest order = — if. (5.9.4) 


Following from our earlier work we might renormalize the four-point 
function I,) by requiring the sum of the 1PI graphs to be equal to their 
lowest order value at zero external momentum. However it is also sensible 
to impose instead the condition on the amputated four-point function T9). 
These graphs are 1PI only in the four external lines. (We should amputate 
the graphs since the counterterm vertex will have attached to it external 
propagators.) Thus in addition to the three tree graphs of Fig. 5.9.1 (a)-(c), 
we require the counterterm, Fig. 5.9.1(d): 

Our general method of renormalization tells us that whenever we have a 
basic graph containing one of the graphs (a), (b), or (c) in Fig. 5.9.1 as a 
subgraph, there will be counterterm graphs in which this subgraph is 
replaced by the counterterm vertex (d). These counterterm graphs may be 
divergent even when the basic graph is finite. An example is shown in 
Fig. 5.9.2. In Fig. 5.9.2(a) if we impose the renormalization condition on the 
1 PI functions only the graph (b) occurs as counterterm:;(a) plus (b) is finite. If 
we impose the condition on amputated graphs we immediately meet graph 
(c) where the line A is replaced by its 1/3 share of the counterterm Fig. 
5.9.1(d). 

But graph (c) has a divergence, so we must renormalize it by a three-point 
counterterm to the subgraph consisting of the line 4 and the loop B. This is 


F F 
TT 


Fig. 5.9.2. The subtractions of Fig, see inside a bigger graph. 
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shown in graph (d). Note that the graph consisting of line A and loop B has 
a subdivergence, but no overall divergence. Even so, the overall counter- 
term in (d) is divergent. 

It is not difficult to see that the extra counterterms needed to impose the 
renormalization condition on the 1PR amputated graphs do not change 
our results on renormalization. The instructions for renormalization in 
Sections 5.3 and 5.5 can be used provided only that we replace the term ‘1 PI 
subgraph’ by ‘amputated subgraph’. 

The use of subtractions on 1PR graphs is too baroque for normal use. 
However it is a device that is useful for discussing the large mass expansion 
and the operator-product expansion (Chapters 8 and 10). 


5.10 Renormalization without regulators: the BPHZ scheme 


In setting up the renormalization procedure in Sections 5.3 and 5.5 we were 
careful not to use a specific definition of the subtraction operation. This was 
to allow for the choice of one out of the infinitely many possible 
renormalization prescriptions. An obvious one is the mass-shell sub- 
traction procedure indicated in the last section. Another is the minimal 
subtraction procedure to be defined precisely in Section 5.11; we have 
already made much use of it. In this section we will explain the method of 
Zimmermann (1969), in which the subtractions are applied directly to the 
Feynman integrand, so that no regulator need be used. 

The starting point is the method due to Bogoliubov & Parasiuk (1957) 
and Hepp (1966), called the BPH scheme. They observed that the overall 
counterterm for a graph T is a polynomial of degree ôT), its degree of 
divergence. So they defined the subtraction operator T(T) to be the terms 
up to order ô(T) in the Taylor expansion of I about zero external 
momentum. For example, consider the one-loop self-energy graph 
Fig. 3.1.1 in @°* theory in six dimensions. After dimensional regularization 
its unrenormalized value is 


g? 


2, (p?, d) = Nän a an| dx [m? — p*x(1 —x)]*/*~*. (5.10.1) 


The terms up to order p? in its Taylor expansion about p = 0 are 
2 


=g 


ToL = 
a 2(4n 4n)” 


=y [ (2 — d/2) x 


x i dxm~4[1 — (d/2 — 2)p?x(1 — x)/m?]. (5.10.2) 
(0) 
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The renormalized value of the graph is T, — T-T. So at d =6 this is 
2 


pe ae |. dx {[m? — p?x(1 — x) ]In[1 — p?x(1 — x)/m?] + p?x(1 — x)}. 


| (5.10.3) 

Zimmermann’s (1969) achievement was to realize that this construction 
can be applied directly to the integrand. Subdivergences are subtracted with 
the aid of his forest formula. Then the result is an integral which, according 
to power-counting, has no UV divergences. The integral therefore has in 
fact no divergences (Hahn & Zimmermann (1968), Zimmermann (1968)). 
This method is called BPHZ renormalization. 

In Section 3.4, we applied this method to the above graph, with the result 
(3.4.7). It can be explicitly calculated by putting all the terms over a common 
denominator and then using standard parametric methods. The result 
agrees with (5.10.3). 

An example involving a subdivergence is given by Fig. 5.10.1 for #* in 
four dimensions. Let the renormalized integral be 


ig? 
T apyz = sa faase, P2, P3, Pas K, L). (5.10.4) 
P3 
Pi k+ ps 
k+l 
P2 k— Pe Pa 


Fig. 5.10.1. A two-loop vertex graph in ¢* theory. 


Then we will construct the integrand I. 
The unrenormalized integrand is 


Subtraction of the sole subdivergence gives 
1 1 1 
(I? — m*)* [(k + p3) — m°] [(k — p4}? — m°] 
_ — 2k-1— k? 
(P — mP [Kk +D? — m7] [(k + ps)? — m7] [(k — py)? — m] 
(5.10.6) 


R(U)=U— 
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Then the overall divergence is subtracted to give 
I = R(U) = R(U)— [R(U)] l =p2=p3=p4=0 


_ (2k:l + k?){(k? — m?)[ pj + p3 + 2k-(p3 — pa)] + (p3 + 2k-p3)(p3 — 2k: p4)} 
(I? — m°)? (k? — m?)?[ (k + 1)? — m7] [(k + p3)? — m7] [(k — p4)? — m?] 
(5.10.7) 
The BPHZ scheme has a number of advantages: 


(1) It is applied to the integrand and generates a convergent integral 
without requiring any regularization. 

(2) Thus it exhibits the fact that the properties of a renormalized field 
theory do not depend on which UV regulator is used. 

(3) Mathematically it is rather elegant. In particular there is no need to 
discuss directly the divergences of Feynman graphs; it is only required 
to have a theorem that tells us that a graph that is convergent according 
to the naive criteria is actually convergent. 

(4) It allows a very simple proof of the operator-product expansion. 


There are a number of disadvantages: 


(1) It is not the best scheme for theories (especially gauge theories) with 
complicated symmetries, where relations between counterterms have to 
be preserved ; the scheme does not allow direct computation of the value 
of a divergence. 

(2) The subtractions are made at zero momentum and therefore are infra- 
red divergent in a massless theory. 

(3) When the scheme is generalized to handle massless theories, it becomes 
much more complicated (Lowenstein, Weinstein & Zimmermann, 
1974a, b). 


5.11 Minimal subtraction 


5.11.1 Definition 


It can be proved (Speer (1974) and Breitenlohner & Maison (1977a, b, c)) 
that, when dimensional regularization is used, the UV divergences of 
Feynman graphs appear as poles at isolated values of the space-time 
dimension d. Minimal subtraction (t Hooft (1973))— the MS scheme — 
consists of defining the counterterms to be poles at the physical value of d, 
d = 4. We have already used this scheme, in Chapter 3. Our purpose in this 
section is to make precise the definition of minimal subtraction. 

The main complication is that bare couplings have a dimension that 
depends on d, so that we must introduce the unit of mass yp, as follows: 
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(1) Consider in turn the coefficient g; + ôg; of each term in Z. Let the 
dimension of g; be a; + b,(4 — d). Then we replace g; + 6g, by p*~® 
(g; + 6g,). Thus the renormalized coupling g; and the counterterm ôg; 
both have dimension a,, independently of d. 

Let I be a 1PI graph to which it is desired to apply a subtraction 

operator T. Let the dimension of I be A + B(d — 4), and suppose the 

couplings all contain powers of u as just explained. Then we define 
T(T) = p®¢- {pole part of (u?4~®T) at d= 4}. 

The pole part is obtained by making a Laurent expansion about d = 4. 

We have arranged to take the pole part of a function whose dimension 

does not depend on d. 

(3) Suppose we are talking about a theory in a different number of physical 
dimensions than four. For example, we might be in ° theory in six 
dimensions. Then the ‘4’ in the above formulae is replaced by the correct 
physical value. 


(2 


N 


For a simple graph with no subdivergences, like the one-loop self-energy 
in (5.10.1), this prescription amounts to subtracting the pole: 


gius 1 
Esd = 6) = imd E T(2 —d/2) | dx[m? — p?x(1 — x) ]4?/?~? 
d>6 2(4 ) 0 


—| pole = Ea es (m? — Sp”) 
128n° d/2 — 3 i 


~ 12873 Bes De 1—In (4r) ](m? — 4p?) 


+ l, dx[m? — p?x(1 — x) ] In eee (5.11.1) 


For graphs with subdivergences, the subdivergences must of course be 
subtracted before removing the overall pole. 
The advantages of the scheme are: 


(1) It automatically preserves complicated symmetries. The exceptions are 
chiral symmetries and the like, which in general cannot be preserved by 
quantization — see Chapter 13. 

(2) It has no problems with massless theories. In fact, dimensional 
continuation regulates both IR and UV divergences, thus removing the 
need for a separate IR cut-off. 

(3) Calculations are very convenient. 

(4) Computation of the divergent part of a Feynman graph — needed for 
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renormalization-group calculations — is almost trivial at the one-loop 
level. 


Some disadvantages of minimal subtraction are: 


(1) It is unphysical. 
(2) The proof of the operator-product expansion is made harder than in the 
BPHZ scheme. 


5.11.2 MS renormalization 


The MS scheme has found much use especially in work on QCD, where it 
has become standard. Another disadvantage that then appears is that 
minimal subtraction tends to produce large coefficients in the perturbation 
expansion. These are primarily due to the In (42) — yp ~ 1.95 term such as 
appears in (5.11.1). It has become conventional to work with a modified 


scheme, called the MS scheme (Bardeen, Buras, Duke & Muta (1978)). 
Here the u of the MS scheme is written as | 


e% \ 1/2 
u=i( $) ~ 0384. (5.11.2) 
Then we have, instead of (5.11.1), the cleaner form 
MS g? pa ree 
a 5 li m 


1 2 ee 
-| dx[m? — p*x(1 — x) ] In k (5.11.3) 
0 


5.11.3 Minimal subtraction with other regulators 


Minimal subtraction could also be applied with other UV cut-offs. For 
example, if a lattice of spacing a is used, then the singular a > 0 behavior of 
graph of degree of divergence 6 is 

a~°*[ polynomial in In (a) ]. 


One can therefore define T(T) as the singular part of T, with the general 
form 


Borax 56 Bmax] 
T(T)= > [in(ap)]’4o,+ } 2 aelin (aw PAs, (5.11.4) 
B=1 a=1 p= 


After subtraction of subdivergences, the coefficients A, , are polynomials in 
masses and momenta. Note again the appearance of a unit of mass. This 
scheme has found little use. 


6 


Composite operators 


In Chapter 2 we met a number of equations involving products of field 
operators at the same point. Examples are given by the equations of motion 
(2.1.10) and the Ward identities (2.7.6). These products we will call 
composite operators. When computed directly they have ultra-violet 
divergences: the product $(x)¢(y) makes unambiguous sense if x is not 
equal to y, but if x equals y then we have ¢$(x)?, which diverges. Since the 
equations of motion and the Ward identities express fundamental proper- 
ties of the theory, it is useful to construct finite, renormalized composite 
Operators with which to express these same properties. 

It could be argued that there is no need to have renormalized equations 
of motion. One could say that one only actually needs the equations of 
motion in the regulated theory, where they are finite. A situation of practical 
importance where we actually do need renormalized composite operators is 
the operator-product expansion, to be discussed in Chapter 10. This is used 
in a phenomenological situation such as deep-inelastic scattering 
(Chapter 14) where we wish to compute the behavior of a Green’s function 
when some of its external momenta get large. Equivalently, we need to 
know how a product of operators, like @(x)@(y), behaves as x—> y. 

This information is contained in the operator-product expansion of 
Wilson (1969) which has the form 


(x) f(y) ~ C(x — yl + Cal- WOO)? J +--. (6.0.1) 


Here the symbol [ A(x) ] denotes the renormalized operator corresponding 
to an unrenormalized composite operator A(x). The coefficients C(x — y) 
are c-numbers, and each has a subscript which labels the operator that it 
multiplies. 

Therefore in this chapter we show how to renormalize Green’s functions 
of composite operators, e.g., 


<0|T d(x) b() 62(2)|0>, 
{01 T d(w)b()d(y)"b(z)?0, (6.0.2) 
<0| T(x)? (y)?|0. 
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We will first motivate the use of composite operators by seeing how the 
operator-product expansion arises in a low-order graph. Then we will 
examine the divergences that appear in low-order graphs for composite 
operators. We will see that we must expect multiplicative renormalization: 

[°] =Z,2¢’, (6.0.3) 
where [ $7] is finite as the UV cut-off is removed, while Z,. is a divergent 
renormalization factor. The unrenormalized operator (x)? is divergent 
when the cut-off is removed. 

These examples will provide motivation to define renormalized com- 
posite operators by application to Feynman graphs of the same R- 
operation that we defined in Chapter 5. After discussion of a number of 
technical issues, we will derive some basic properties of the renormalized 
operators, including the equations of motion and the Ward identities. 


6.1 Operator-product expansion 


We will postpone a complete treatment of the operator-product expansion 
to Chapter 10. Here we merely wish to motivate our definition of composite 
operators with an example of their use. 


q G+Pit+Po 
P: qt+DP, — P2 


(a) 


q Q+P,+P2 Y 
Pi =p 
oe l ' Q 


Fig. 6.1.1. Take q — œ in these graphs to obtain the lowest-order example of the 
operator-product expansion. 


Consider the graphs of Fig. 6.1.1 for the four-point function in ġ?° theory. 
We let q“ go to infinity with p, and p, fixed, and with the ratios of the 
components of q fixed. Then |q?|—> œ. We expand the graphs in powers of 
q? to find: 


| — 
Fig. 6.1.1(a) + (b) ~ aga lar (6.1.1a) 
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=y: ig 
Fig. 6.1.1 (c) ~ r n a ete 


á q (4°) 
(6.1.1b) 


In each term the dependence of p, and p, has factorized. We now show 
that this is a case of an operator-product expansion like (6.0.1) (after 
Fourier transformation into momentum space). 

In (6.1.1a) the factor in square brackets is in fact the value of the lowest- 
order graph for the following Green’s function: 


(0|TH(— p:)È(— p2)$(0)/210) 
= | d*x | d*yexp(—ip,-x— ip,-y)<0|T$(x)G(y)d?)/2|0>. (6.1.2) 


We have not Fourier transformed the @7(0) operator, but have set it at the 
origin. If we had made the Fourier transform, then we would merely pick up 
a momentum-conservation 6-function, which we do not have in (6.1.1a). 
To understand the appearance of the operator $7(0)/2 in (6.1.2), we may 
find a functional-integral formula for the Green’s functions that appear in 
this equation. Since 7(0) means the product of two fields at the same space- 
time point, such a formula follows from our work in Section 2.2. It is 


(0| T h(x) h(y)b*0)/2|0> = M | [dA] A(x)A(y)z4*(OJe*. (6.1.3) 


The Feynman rules for this Green’s function can then be derived. They 
are the usual ones for the Green’s function (0|T¢(x)¢(y)|0> with the 
addition that each graph contains exactly one special vertex for the @7(0)/2 
operator. The lowest-order graph is shown in Fig. 6.1.2, where the special 
*/2 vertex is indicated by a cross. The value of the vertex is unity, for the 
explicit factor 1/2 in @7/2 gets cancelled. This happens in exactly the same 
way as the 1/4! in the $* interaction or the 1/3! in the @° interaction gets 
cancelled to leave a value — ig for an interaction vertex. 

The operator #7(0)/2 is our first example of a composite operator (or 
composite field). By this term we mean, in general, a product of elementary 


/\ 


Fig. 6.1.2. Lowest-order graph for two-point function of $7. 
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fields (or their derivatives) at the same point. It is the properties of such 
operators that we will investigate in this chapter. 
We can write 
2i 


2 
Fig. 6.1.1(a) + (6) ~ zs 0|TB(p1)4( P2)4(0)7/2/0), (6.1.42) 


This is illustrated in diagrams by Fig. 6.1.3. 
In similar fashion, we derive an operator formula for (6.1.15): 


. ig? 2iq" ô (2m? + 4q"q”ð ð 
Fig. 61.116) ~ [it] 1 2 pte ee | 
x (0|T$(P1)$(P2)(x)|0>|,=0- (6.1.4) 


2 


2ig - 2 
(a) + (b)~—] i (c)~ = y coefficients x derivatives 
(q ) (q? 


Fig. 6.1.3. Generation of terms in the Fig. 6.1.4. Generation of terms in the 
operator-product expansion from the operator-product expansion from the 
graphs of Fig. 6.1.1(a) and (b). graph of Fig. 6.1.1(c) 


Here the form of the first square-bracket factor means that we need an 
elementary field (x), rather than a composite field. The p,’s and p,’s in the 
numerators in the second square-bracket factor have turned themselves 
into derivatives with respect to x; we set x = 0 at the end. Equation (6.1.45) 
is illustrated in Fig. 6.1.4. 

The form of (6.1.4) suggests the following formula: 


<0|T (4) (0) d(P,)(P2)|0> 
~ È C0) 0| T 0,0) $(P:)$(2)|0>. (6.1.5) 


The sum is over a set of local operators ©;. Each of these is either the 
elementary field ¢, one of its derivatives, or a composite operator such as 
$°. The coefficients C,(q) are called the Wilson coefficients. In Chapter 10 
we will generalize this result to all orders. We will have an expansion for any 
Green’s function with large momentum on some of its external lines. 

Now higher-order corrections to the Green’s functions of the composite 
Operators such as the ¢7(0) that appears in (6.1.4a) have ultra-violet 
divergences beyond those appearing in Green’s functions of elementary 
fields. We will see this in the next section, Section 6.2. To obtain an operator 
product expansion, like (6.1.5), with finite coefficients, we will need to 
renormalize the composite operators. This particular problem will occupy 
most of this chapter. 
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6.2 Renormalization of composite operators: examples 


6.2.1 Renormalization of o? 


The Feynman rules for Green’s functions of unrenormalized composite 
operators can be derived from the functional integral, in the presence of an 
ultra-violet cut-off. In coordinate space they are the usual rules, modified 
only by having several external fields at the same point. For example, we 
consider 


<0|T d(x) 6(y) 67 (z)/2|0> (6.2.1) 


in @° theory in six-dimensional space-time. The connected graphs up to 
order g? are shown in Fig. 6.2.1. As before, the vertex for 67/2 is denoted by 


a cross. 
(a) (b) (c) 


(e) 


Fig. 6.2.1. Renormalization of the operator ¢”. 


To work in momentum space, we Fourier transform, as usual, and define 


G = (0|TH(p) b(q) d(z)"/2|0> 
- | dxd4yexp[i(p, x + pr-y]<0| TH) 4(2)?/2|0) (6.2.2) 
The lowest-order graph, Fig. 6.2.1(a), is equal to 


l 1 

a= (p? — m + ie) (p? — m? + ie) (a 

Observe that the factor 1/2 in the operator 7/2 is cancelled, just like the 
1/3! that comes with the interaction vertices. 

Let us now turn to the one-loop graphs of Fig. 6.2.1. They are all 
divergent: Fig. 6.2.1(b) is logarithmically divergent, while the remaining 
graphs, Fig. 6.2.1(c) to (e), are quadratically divergent (all at d = 6). The 
divergences in the last two graphs, Figs. 6.2.1(d) and (e), involve self-energy 
corrections only, so these divergences are cancelled by the usual wave- 
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—e—— —K¥—e— 
(a) (b) 


Fig. 6.2.2. Counterterm graphs for Fig. 6.2.1(d) and (e). 


function and mass counterterms, Fig. 6.2.2. In these and other graphs in this 
chapter, we indicate counterterms by a heavy dot and an insertion of a 
composite operator by a cross. 
The remaining two graphs have no counterterm from the interaction, 
and they are both divergent. For Fig. 6.2.1(b) we get 
i l 


A E 
> (p? — m? + ie) (p2 — m? + ie) 


goaa : ea ae | 
K J (K? — m? + ie)[(k — py)? — m? + ie] [(k + pa)? — m? + ie] 
1 i g? i 
(p (p? — m?) (p2 — m?) m?) 64n zI (3 — 4/2) <| dx x 


0 


«|g k re ae 
ye ee ee 


; Anu? 
(6.2.4) 
and for Fig. 6.2.1 (c) we get 
— n,3-—d/2 
OSa ae a 2 2 * 
(py — m*)(pz — m*)[(p, + P2)° —m*] 
TRE  ) ae a ae 
2(2n)! (k? — m*)[(p, + pz +k)? —m*] 
= gp? 4! | 
= —aaaaaaaalulssssssstsulutlulsasiÃħi— X 
(pi —m’)(pz — m°)[ (p, + = - m°] 
d/2 -3 2 d/2 -2 
— gu m? — (p, + p)*x(1 — x)] 
Aes —=____T(2 — d/2) f axl 1 Fru . (6.2.5) 


Note that there is a symmetry factor 1/2 in this last equation. The fact that 
the sum of (6.2.4) and (6.2.5) diverges means that the operator ° (0) is not 
finite. 

For use in the operator-product expansion we do not need precisely the 
operator p°. Rather, we need some local operator similar to œ? that is finite. 
This indicates that we should define a renormalized operator by sub- 
traction of the divergences. Let us agree to use minimal subtraction. Then 
the counterterm graphs are obtained by replacing each divergent loop by 
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deh cade 


(a) (b) 
Fig. 6.2.3. Counterterm graphs for Fig. 6.2.1(b) and (c). 


minus its pole part, as illustrated in Fig. 6.2.3. Thus the counterterm graph 
for Fig. 6.2.1(b) is 
i? g? 
(pi — m*)(p3 — m°) 64r° (d — 6)’ 
and the counterterm graph for Fig. 6.2.1 (c) is 
3—d/2 d/2 -3 
— gu gu 2.1 2 
oe a alm - 40, + . 
P? — m?)(p3 — m*)[(p, + p) — m"] par So EE 1} 
(6.2.7) 
The positioning of the factors of u is such that the counterterm in curly 
brackets has exactly the same dimension as the loop to which it is a 
counterterm. 
We thus find the renormalized values at d = 6: 
iz g?’ y 1 
R(G,) == 9-5 L d 
a cee | xx 
a o m? — (piy + p3x)(1 — x — y) — (Pı + P2) xy |} (6.2.8) 
4nu? ? 


(6.2.6) 


OE: E ee 
RO) = BF mp} — mlo, + Pa)? — rene i 
1 
do — 1)[m? — ¿(p + p2}? ] + | dx [m? — (p, + p2}? x(1 — x)] X 
(0) 
zin [me | (6.2.9) 


Any? 


To interpret these renormalizations we observe that the counterterms are 
vertices for @?(0) in (6.2.6) and for (m? + [/6)@ in (6.2.7). Thus 


G, + R(G,) + R(G,) + R(G,) + R(G,) 
7 g? y y 1 
= [ + arag l T (P;)b(P2)+* (0)|0> 


d/2-3 rn í 
arg T $(P1)$(p2)(m? +40)40)0> + 0(g*). 


(6.2.10) 


+ 
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So what we are computing is a Green’s function of the operator 


irota g? 142, 9H 
al¢ Jef: targ le + ondo) 


d/2—3 


(m? +é())¢ + higher order. 
(6.2.11) 


We use the square brackets on the left-hand side to denote a renormalized 
operator. In subsequent sections we will see that this result generalizes to all 
orders: a renormalized operator [¢7] can be defined to all orders by a 
formula of the form: 


2L¢*] = 2,29? + u”? Zym o + ut? Oo, (6.2.12) 
where the Z’s depend only on g and d. 

Such renormalized operators were defined by Zimmermann (1973a). He 
called them normal products and used the notation N [7]. His definition 
differed from ours only in that he used BPHZ renormalization instead of 
minimal subtraction. l 

Observe that in order to obtain a degree of divergence of at least zero, the 
operators that appear as counterterms in (6.2.11) or (6.2.1 2) have dimension 
less than or equal to that of the original operator 7. This is a general 
phenomenon. Moreover, the only operators of such dimension are those 


actually appearing in (6.2.12). We may write the renormalization as a 
matrix equation in the following form: 


alp ]\| [Za wi? *Zym* PZA 20 
@ į|=ļ|0 1 0 p |. (6.2.13) 

Oo 0 0 1 o 
Here we have used the fact that @ and [O ¢ are finite. The operators ¢ and 
Oo are said to mix with $7 under renormalization, because the off- 
diagonal elements Z, and Z, are non-zero. Moreover, no further operators 
are needed in the renormalizations, so ¢*, ¢ and (1¢ are said to form a 

closed set under renormalization. 


6.2.2 Renormalization of 67(x)7(y) 


Sometimes we need Green’s functions involving two or more composite 
operators. A simple example is 


<0|T3[6?](p)3[ G7 ](0)|0> 
= [tse <0] TEL47] 3) §L47100)]0>, (6.2.14) 


for which the lowest-order graph is Fig. 6.2.4. The renormalizations of 
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=€ > 
p 
Fig. 6.2.4. Lowest-order graph for <0|T[¢7]/2[¢7]/20>. 
(6.2.11) do not appear until the next order, so the graph has the value 
— = | | ee (6.2.15) 
2(2n)4 (k? —m?)[(p + k)? — m7] — 
This is ultra-violet divergent, even in free-field theory. Even though the 
operators @*(x) and 7(0) on the right of (6.2.14) are well-defined, we have to 
integrate through the point x = 0. At x = 0 there is a singularity, which is 


not integrable if d >4. We may nevertheless define a finite Green’s function 
by adding a local counterterm: 


COI T3[.67](x)5[7]O]0>z = <0| T3[ 7] (x)5[ 7] 0)|0 


~ C(x) <0|1]0). (6.2.16) 
with 
C(x) =(m? + EVA OOT =e) + O(g°) (6.2.17) 
64r? (d — 6) l a 


Once more we have used minimal subtraction at d= 6. 


6.3 Definitions 


We define renormalized Green’s functions of composite operators by 
applying the R-operation to the Feynman graphs, just as we did for Green’s 
functions of elementary fields in Sections 5.3 and 5.5. We will need to show 
that the counterterms generate multiplicative renormalizations of the 
operators (e.g., (6.2.12)). This is similar to what we did in Section 5.6, where 
we showed that the counterterms in elementary Green’s functions are 
generated by counterterms in the Lagrangian. Our motivation for starting 
with the graph-by-graph renormalization is again to allow a simple 
treatment of the problems of subdivergences. We do not need a new proof 
that the counterterms for operator insertions are local; our original proof 
suffices. 

As before, we have a choice of many renormalization prescriptions. The 
ones that are most useful for subsequent developments are the BPHZ 
scheme — see Zimmermann (1973a)— and minimal subtraction — see 
Breitenlohner & Maison (1977a, b, c) and Collins (1975b). In any case we 
have a subtraction operator T(G) which is applied to a graph G (after 
removal of subdivergences) in order to extract the divergent part of G. In the 
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BPHZ scheme, T(G) gives the first 6(G) terms in the Taylor expansion about 
zero external momentum, where ô(G) is, as usual, its degree of divergence. 

In the case of minimal subtraction we must state how the unit of mass pis 
treated. As can be seen from the examples in Section 6.2, we must always 
arrange to compute the pole part of a quantity whose dimension does not 
vary with d. So suppose we need 7,,.(G) for a 1PI graph G of dimension 
u(4~ 40). Then we define 


T= pët {pole part of Guo" 9}, (6.3.1) 


where, as usual, dọ is the physical space-time dimension. 

The definition of a renormalized Green’s function by the R-operation is 
rather abstract, and we will now show that it amounts to adding 
counterterm operators. In the one-loop examples of Section 6.2 this was 
rather obvious. In the general case, we start from the formula for 
renormalization of an arbitrary Green’s function (see (5.3.7)—(5.3.9)): 


R(G)=G+ C/G). (6.3.2) 


Here, as usual, the sum is over all subgraphs y that consist of a set of disjoint 
IPI subgraphs y,,...,y,- Each y; is replaced by its overall counterterm 
vertex, generated as in Section 5.3. 

Let us distinguish the various ys that occur, according to the 
number of composite operator insertions that they contain. Consider, 
for example, Fig. 6.3.1, which illustrates the renormalization of 
<0|Td o 7/2|0> in the ø? theory in six dimensions. There is a one-loop 
subgraph y, for the three-point function. This is renormalized by its 
counterterm C, in the Lagrangian. There is also a two-loop subgraph y, 
which contains the composite operator vertex. (This has a subdivergence, 
which must be subtracted.) The counterterm C,, can be considered as 


where 
~~ Sial 
| | 
C,, = i | 
| CER ee Speer 
ae mame r-a 
{ 
ae -i a | 
Ye | | : 
eee eee 


Fig. 6.3.1. Renormalization of two-loop graph with insertion of composite operator. 
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Fig. 6.3.2. Renormalization of three-loop graph for ¢0|T[ 7 ]/2[ 67 ]/2|0>. 


generated by an O(g*) term in the renormalization factor Z,. 

Next consider Fig. 6.3.2, a graph of order g* for the Green’s function 
<0| T[¢7]/2[67]/2|0>. In addition to an overall logarithmic divergence, it 
has three divergent subgraphs. One subgraph y, is renormalized by a 
counterterm in the interaction Lagrangian. The other two subgraphs y, and 
yą each look like y, of Fig. 6.3.1, and are renormalized by the same 
counterterm. These two counterterms are generated from 

(0|73[ 67 ](x)3L¢7]0)|0> = Z7<0|73673¢7|0 
+ other terms from Z, and Z,, (6.3.3) 


by expanding each Z, to O(g*) and picking out the terms for y,. Finally, the 
overall counterterm is obtained. It gives a term of O(g*) in the C(x) of 
(6.2.16). 

These arguments generalize easily to arbitrary graphs. It suffices to 
consider a renormalized Green’s function of one composite operator 


N 
O/T [T $)LAO)]|9>. (6.3.4) 


We define the renormalization by the recursive formula (6.3.2), and we wish 
to prove that this equals 


2 Za06O|T [] $(x,)By)|0>. (6.3.5) 


Here Z,, is a renormalization factor whose value is given by writing, in 
analogy to (5.6.5), 


1 
[A] = } ZasB = X} D(C). (6.3.6) 
B G G' 


Here Gis any 1PI basic graph that includes a vertex for A. It has Ng external 
lines in addition to the vertex for A, and D(G) is the operator corresponding 
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to the overall counterterm C(G). The factor 1/N,! is just like the 1/N(G) in 
(5.6.5), to organize the symmetry factors. 

Consider a graph for (6.3.4). We investigate one of its counterterms C,(G). 
The subgraph y consists of 1PI graphs y,,...,y,. A y; that does not contain 
the vertex for A is replaced by C(y,), which corresponds to one of the 
counterterms in Z. A y, which does contain the vertex for A must be one of 
the G’s that are summed over in (6.3.6), so the counterterm C „ 18 generated 
by one of the counterterm operators on the right of (6.3.6). 

Now sum over all graphs G for our Green’s function and expand each 
R(G) by (6.3.2). The sum over 1PI subgraphs y, that correspond to 
counterterms in Z can be done independently of the sum over the 
subgraphs giving the counterterms for the operator vertex. The result is 
then the desired result (6.3.5). 


6.4 Operator mixing 


We have seen that a renormalized composite operator [A] is expressed in 
terms of unrenormalized operators by 


[A] = YZ 5B. (6.4.1) 
B 


In the case of [ġ?] we saw that the operators that were needed as 
counterterms had the same or lower dimension. Let us now demonstrate 
this for the general case. 

The proof is essentially dimensional analysis. Let G be a 1PI graph 
containing a vertex for A and having the same number JN, of external lines 
as a particular operator B. Now B is a product of N, fields with a certain 
number D, of derivatives. A counterterm can only be generated if the degree 
of divergence 6(G) is at least D,. Now in a renormalizable theory all 
couplings have non-negative dimension, so 


6(G) = dim (G) — dim (couplings) 


< dim (G). (6.4.2) 
On the other hand, since Z,,B is a possible counterterm we have 
dim (G) = D, + dim (Z 4p). (6.4.3) 
But we only need B as a counterterm if Dz < 6(G), so 
dim (Z 48) 20 


for every counterterm. This means that the maximum dimension of a 
counterterm operator is the dimension of A, as we wished to prove. 
In Section 5.8.1 we examined the dependence of counterterms in the 
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Lagrangian on mass parameters. Provided we used minimal subtraction 
this dependence was polynomial, with the mass behaving as a dimensional 
coupling in determining allowed counterterms. The same argument applies 
here. The result is that Z,, 1s a polynomial in masses (and super- 
renormalizable couplings) times the inevitable power of the unit of mass p. 
The coefficients of the polynomial are dimensionless functions of the 
dimensionless couplings and of d. A typical example of this is given by our 
calculation in Section 6.2 of the renormalization of [7] — see (6.2.11) and 
(6.2.12). 


6.5 Tensors and minimal subtraction 


Suppose we use minimal subtraction to define [(é)*] and [0,60,]. It is 
tempting to suppose that 

[(6¢)*] =9""[6,0,4]. (6.5.1) 
This supposition is in fact false, as we will now demonstrate. This means 
that the taking of a trace does not commute with taking a finite part, in 
general. We will explain the significance of this fact. 


The lowest-order graph (in ¢° theory) for either operator is Fig. 6.5.1. 
The 1PI part for 67/2 before renormalization is 


igw i k-(k +p) | 
G(p) = 2(2n)4 fa wT —m’)[(p +k)? — m7] 
gn 1 m? — p° x(1 — x) d/2 -3 
12873 |. ax] Any? l . 
x {PF (2 —d/2)p?x(1 — x)[m? — p*x(1 — x)] 
+ (d/2)T (1 — d/2)[m? — p*x(1 — x)]?} (6.5.2) 
For 0,@0,@/2 we have 
tt , ki(k+ py 
OPS An JA UE = mL @ + m] 


d/2-3 f1 ee ee d/2 -3 
_ gH Pe ca a x) 7 
1287? Jo Anu 


x {T'(2 —d/2)p,p,x(1 — x)[m? — p*x(1 — x) ] 
+ 39,1 (1 — d/2)[m? — p*x(1 — x) ]*}. (6.5.3) 


k+p k 


Fig. 6.5.1. One-loop graph for the operators in (6.5.1). 
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Manifestly 
g"G,, =C (6.5.4) 


for the unrenormalized Green’s functions. This has to be true since 


êp? = 9""0,60,0. 


We can renormalize by minimal subtraction. 


REG] = Gip) -2 —— [3m + $m2p? — 1p], (6.5.50) 
í P — 28nd — 6) a a 
gu??? 1.2 1.2 


— g,,(4m* — imp? + gop") |. (6.5.5b) 
Thus 


d/2-3 


gu 
R[G(p)] — g’’R[G,,(p) ] = — Taga Lam" — gmp? + gop*]. (6.5.6) 


The reason why contraction with g’*” does not commute with the 
subtraction of the pole is simply that taking the trace introduces d- 
dependence. Thus: 


g d 
py 2 
g” pole part oF A ae 
d 6 
pole part of 5 Se Sus = pole ae oe 


We must evidently be careful to specify whether a trace is inside or outside 
of the renormalization. The need to do this is characteristic of dimensional 
regularization. Which place to put the trace depends on the problem under 
consideration. 

The problem arises whenever we have to consider a tensor of rank at least 
2. (It could also arise in connection with taking a trace of Dirac y-matrices 
except that we choose the trace of the unit Dirac matrix to be independent 
of d.) We have discovered that our renormalized operators do not have all 
the properties that the bare operators do. The lack of commutativity of the 
trace and the finite-part operation is related to a physical effect, that there is 
an anomaly in the Ward identity for scale transformations — see Callan 
(1970), Symanzik (1970b) and Brown (1980). 

If we were to use, say, Zero-momentum subtractions (BPH or BPHZ), then 
the trace and the finite-part operation would commute — as can be checked 
from our example. So it might appear that zero-momentum subtraction 
provides a better all-purpose definition of renormalized operators than does 
minimal subtraction. However, some of the properties we will prove when 
using minimal subtraction now disappear or become more complicated. 
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For example, the equations of motion which we will prove in Section 6.6 are 
only true if the mass terms are oversubtracted. This turns out to prevent 
some Ward identities from being true when the simplest renormalized 
operators are used, whereas they are true in their simplest form when using 
minimal subtraction. We will prove this in Section 6.6 also. The moral is 
that one cannot completely eliminate the problems. 

It is possible (Collins (1975b)) to construct a definition of, say, 
[3 p, p/2] which uses minimal subtraction and for which g*"[¢,60,6/2] = 
[(6@)*]. This is done by writing tensors in terms of Lorentz-irreducible 
components. Thus we write a second-rank tensor M_, as the sum of an 
antisymmetric term, a symmetric traceless term, and an invariant term: 


My = 2(M,, — M,,) + 2(M,, + My, — (2/09. My) + (1/4) 9. Me 


The subtraction procedure is applied to each term separately. This definition 
loses other properties of the renormalized products. For example, con- 
servation of energy and momentum is a consequence of the fact that the bare 
energy-momentum tensor 0, has zero divergence: ĉ“0,, = 0. If we define a 
renormalized energy-momentum tensor [0,,] by our original definition of 
minimal subtraction, then this is the same as the bare @,,, up to allowed 
redefinitions and it is conserved. But if we construct [@,,,] by the procedure 
just suggested, then it is not conserved. 


6.6 Properties 


One of our motivations for working out the theory of renormalization of 
composite operators was that in Chapter 2 we had proved equations of 
motion and Ward identities. These results involved unrenormalized 
composite operators. So now that we have defined renormalized composite 
Operators, we must prove the equations of motion and Ward identities 
expressed in terms of these renormalized operators. This is particularly 
important for the Ward identities, for these express the symmetry properties 
of the theory. 

In this section we will derive a number of useful properties of the 
renormalized operators. Some properties will be purely technical, while 
others will be the actual equations of motion and Ward identities. Our proof 
will be given for the case that the operators are renormalized by minimal 
subtraction. A typical proof of some equation starts by observing that the 
corresponding equation is true for the unrenormalized operators. 
Renormalization is almost the same procedure applied to both sides of the 
unrenormalized equation, so the main problem is to find the places where the 
renormalization procedure is not identical for the two sides. 
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It is always possible to make the theorems false by changes in the 
renormalization prescription. The point of using minimal subtraction is that 
it is a universal prescription that preserves almost all of the desirable 
properties. (The reason is that it amounts, roughly, to defining each 
counterterm by the requirement ‘remove exactly the singularity’.) These 
properties are relations between different operators. 

The other standard renormalization prescription that preserves most of 
these relations is the BPH (or BPHZ) method of zero-momentum 
subtraction. In fact, the proofs were first given using the BPHZ prescription 
(Zimmermann (1973a, b), Lowenstein (1971) and Lam (1972)). However the 
use of minimal subtraction is better for gauge theories because of their infra- 
red singularities. The proofs were given in this case by Collins (1975b) and 
Breitenlohner & Maison (1977a, b, c). All these works are rather technical. 
However, the basic ideas are simple. 


Property 1. Linearity: 
a[ A] +b[B] = [a4 + bB], (6.6.1a) 


where aand bare pure numbers, while A and Barecomposite operators. This 
equation is to be interpreted as an equation for Green’s functions of the 
operator. That is, if X is any product of renormalized operators (elementary 
or composite), then 


a<0|T[A]X|0> + b<0|T[B]X|0> = <0] T[ad + bB]X|0) (6.6.1b) 


Proof. This property is almost obvious. If A and B have different numbers 
of external legs (e.g. p° and ¢*), then there is no simple way of defining the 
right-hand side of (6.6.1) except as being the left-hand side. But if A and B 
have the same fields, like (6g)? and $7, then the Feynman graphs for 
<0|TAX|0>, <0|TBX|O> and ¢0|T(aA + bB)X|0) are the same; the 
differences are only in the placement of powers of momentum. The equation 
corresponding to (6.6.1b) is true for the basic graphs (i.e., without 
counterterms). To obtain the renormalized Green’s functions we apply the 
forest formula to each graph. The terms in the forest formula are the same, 
since the graphs for the three Green’s functions are the same. Then (6.6.1) 
follows from linearity of the subtraction operators T,,. 


Comments. (1) Itis necessary to be pedantic about this proof because:(a) itis 
a prototype for less trivial cases, and (b) it fails for the case of zero-momentum 
subtractions. The reason for the failure is that the T, operation is then not 
linear. For example, [ (6@)? |] and [(6¢)* + m? p? ] need two extra subtractions 
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compared with [7], because the degree of divergence is two higher. So we 
can only have 


O [OH Tap +m lo’ Jee = (06)? + mo’ lern (6.6.2) 
if the [67] operator is oversubtracted. 

(2) The coefficients aand bin (6.6.1) must be independent of d, for otherwise 
taking a pole part is non-linear. Furthermore, we cannot use (6.6.1) to show, 
for example, g” [3 „pð, p] = [(0)*]. As we saw, this equation is in fact false. 
The proof fails because it can only be applied to the case that we sum over a 
finite number of operators. It does not automatically apply to infinite 
summations. However, when we defined dimensional regularization in 
Chapter 4, we saw that our vectors and tensors have to have infinitely many 
components. 


Property 2. Differentiation is distributive. Let A be the composite operator 


A= I] (x) 


where each ¢, is an elementary field or one of its derivatives. Then 


Z a= | ]- 3 Pace | (6.6.34) 


i=1 jti 


Again this equation is to be interpreted for Green’s functions: 
ð 
aa <0 T[A(x)]X|0> = <0| T[0A/0x"|X|0). (6.6.3b) 
x 


Proof. Let p“ be the momentum leaving the Green’s function (6.6.35) at the 
vertex for A. Then the derivative 0/dx" gives a factor — ip”. The point of (6.6.3) 
is to state that we get the same results whether or not we take p” inside the 
finite-part operation. To prove the equation, it is enough to observe that this 
statement is true for the basic subtraction operator T,,. 


Comments. (1) We can contract u with an index in A. Thus we have 


go [pap] =9""L6,(6,9)] = [Op] + [O04]. (6.6.4) 
Since the overall derivative merely gives a factor — ip, there is no possibility 
of introducing extra d-dependence by contracting with g“*. This is in 
contrast to the case considered in Section 6.5. 
(2) Note that derivatives are always implicitly taken outside of the time 
ordering. Thus: 


0 
<0|T($0,6)(x)297(y)|0> = lim ra T $(2)6(x}36°())|0>. (6.6.5) 
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This gives the simplest Feynman rules in momentum space, with each 
derivative of a field giving a factor of momentum on the corresponding line. 
The lowest-order graph for the Fourier transform of (6.6.5) is 


| dxd’ yei? =+ia >< 0| T(p0,4)(x)367(y)|0> 


PE d*k ik, Et 
TONS Pta jarem- OY 


Property 3. Simple equation of motion. Decompose the action into a basic 
action and a counterterm action: 
Sf =S t Sai (6.6.7) 


just like the decomposition (5.1.1) of the Lagrangian, except that S „includes 
both the free and interaction terms: S, = {d*x(%_) + Y,). Then define 
functional derivatives with respect to renormalized fields 


OS OL OL 


P(x) =—— = — 0, 6.6.8 
ol) f(x) p(x) “00,0(x) ( ) 
SY\(x) = sh = F¥, with counterterms omitted. (6.6.9) 
We already know the unrenormalized equation of motion (2.5.5) 
(OTF (0X0) =i5 cos (6.6.10) 
Now we wish to prove the renormalized — 
<0| TL A3(x)]X|0> = (6.6.11) 
from which follows the operator equation 
[5] =0. (6.6.12) 


Comments. (1) The functional derivatives are to be treated in a purely formal 
sense. 

(2) Even though [/,(x)] is zero as an operator, its Green’s functions 
(6.6.11) are non-zero because we define them by taking derivatives outside 
the time-ordering. Bringing them inside gives equal-time commutators 
(Section 2.5). 

(3) Signs for fermions are easiest to determine by examining the derivation 
in Chapter 2. 
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Example. In the °? theory 
L = Z(d$)?/2 — mZ $?/2 — goZ? G6, 


we have 
S 4 = —-Z0b — moZo — $902" ", (6.6.13) 
[F] = —-Od— mo -au glo" ]. (6.6.14) 
Then cases of (6.6.11) are 
CO TEF] =- 0, — m’)<0| To(x)b(y)|0> 
= so “*9<0| TL*(x) ]o(y)|0> 
e y), (6.6.15) 


CO TL S30) J[O7(y)]b(2)b() 0. 
= 316° (x — y)<0| TEH Y) ] (2) 6() [0 > 
+iô®(x — z)<0| TL $*(y)]6(w)|0> 
+ 16 (x — w)<0| TL °(y) ]o(z)|0>. (6.6.16) 
Proof. A general proof of (6.6.11) is rather complicated because of the 
arbitrary number of fields. To show the main points it is sufficient to prove 
one case, (6.6.15) in @° theory. The problem in proving the renormalized 
equation from the unrenormalized equation is that the Feynman graphs are 
different for the different terms. 
We write 
PF pg TF iid 
= —(0 + mo — igw te’. (6.6.17) 
Examples of low-order graphs are given in Figs. 6.6.1 and 6.6.2. The 
counterterms are those arising from the action. 
In momentum space we evidently have 


COIT Fo, (PPO) = (p? — m?)<0| TA(p)G(q)|O>. (6.6.18) 


COIT Fo, 4 H(p)|0> = (p? — m’) [—+ -> —) 


Fig. 6.6.1. Low-order graphs for (6.6.15) with free part of action. 
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OTL in e1GP)|0>= O+ (O +O-) 
TOROS 
For 


Fig. 6.6.2. Low-order graphs for (6.6.15) with interaction part of action. 


The p? — m? multiplying the free propagator attached to ẹ(p) cancels the 
denominator of the propagator. Thus in the graph where the other end of the 
propagator is $(q) we obtain the right-hand side of (6.6.15). 

In all the remaining graphs the other end of the propagator is an 
interaction vertex, either a basic interaction or a counterterm. In fact we 
obtain 


COIT P n g(P)P(Q)|0> (6.6.19) 


where 
F inr.g(X) sy S » E S 0.6 

= —(Z—1)0¢ — (mZ — m?) —499Z776?. (6.6.20) 
This is exactly what we must obtain in order that the unrenormalized 
equation of motion (6.6.10) is true. Notice that because (6.6.18) is 
finite, so is (6.6.19). We must now prove that the counterterms in 
(6.6.20) are precisely those that are needed to give the operator 
[Sig] = —¢gu°“?[p7] renormalized according to our standard pre- 
scription for composite operators. 

Now, the renormalization prescription is precisely to add to the basic term 
Sito = —6gu “ob? a series of counterterm operators whose coefficients 
are pure poles at d = 6, so as to make its Green’s functions finite. But this is 
precisely (6.6.20). The relation between the counterterms can be seen from a 
comparison of Figs. 6.6.1 and 6.6.3. 


(OTF ngpo) = (O+ —) 
+ (OO +O Oa) 
+ (<D-+ O-+O-+—} 


+ eee 
Fig. 6.6.3. Renormalization of Fig. 6.6.2. 
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Property 4. Equation of motion times operator. With the same notation as 
before we have 


£0] T[AFj(x)]X|0>_ =i60|TA(x)———X|0>p, (6.6.21) 


5565 ) 


where A is any product of operators at the same point. Hence 
[AS,] =0. (6.6.22) 


Comments and examples. (1) This property is crucial to proving Ward 
identities. 
(2) All operators appearing on the right of (6.6.21) are to be renormalized. 
(3) In ¢° theory, cases of (6.6. y are 
COTL- 06 -m p? — 3967] (x)P() O(z)|0> 
= iô(x — y)<0|T noe [0> + (yez), (6.6.23) 
COTE- 09 -mp — igo IOL] r 
= 2iô(x — y)<0| T[$°]() 6(z)|0> 
+ id(x — z)<0| TL 7] 0) [7] @|0r. (6.6.24) 
Proof. The unrenormalized version of (6.6.21) follows almost directly from 
the previous property in its unrenormalized version. Thisin turnfollowsfrom 
the functional-integral solution of the theory, as shown in Section 2.5. We 
have 
(0| T A(x) F 4(x)X |O> 
7 oA) 
p(x) 09x) 


Then we use the fact that in dimensional regularization 


= i(0| TA(x) 20) + 1¢0| TA x0). (6.6.25) 


6 (0) = {eto = 0, (6.6.26) 


according to the results in Chapter 4. This enables us to eliminate the 
6A(x)/0g(x) term. 

The renormalized equation of motion (6.6.21) can be proved by 
generalizing the method for the previous property. It is enough to consider 
the example (6.6.23). Low-order graphs for the left-hand side of (6.6.23) are 
shown in Fig. 6.6.4. The (— O — m°) factor in Sy p cancels an attached 
propagator. If the other end attaches to an external field (viz., p(y) or #(z)), 
then we have a contribution to the right-hand side. If it attaches to an 
interaction then the negative of a contribution with £ „1s obtained, such as 
Fig. 6.6.5. Since these manipulations do not change the one-particle- 
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COITOS 0, J A(KG(Q)19 


k 
= (qm?) K + (=m) 


p+k+q 
+ {(p? —m’) + [(p + k +q} —m’]} O< 
p 
vesem [KO + <b 


Fig. 6.6.4. Low-order graphs for the left-hand side of (6.6.23), using free action. 


COITL 9S nelke) 


+cts 


Fig. 6.6.5. Low-order graphs for the left-hand side of (6.6.23), using interaction part 
of action. 


(ir)reducibility structure, renormalization can be performed without chang- 
ing the result. 
Since the Sg , terms need renormalization two subtleties arise: 


(1) Since O ġ = g""0,f0,¢, the ambiguity about the placement of g*” is 
relevant. To preserve the derivation it must come before the re- 
normalization is performed. 

(2) Inthe BPHZ scheme m*¢? must be oversubtracted otherwise we cannot 
use linearity to combine — op O ġ and —m7¢’. 


The case of (6.6.24) involves two further subtleties illustrated by Fig. 6.6.6. 
In the first graph (a) the ( — O — m?) multiplies the line coming back toa (x) 
factor. This term gives zero after use of (6.6.26). The second graph (b) has two 
1PI loops separated by a line. In the basic graph, the q? — m? factor cancels 
the propagator to give graph (c), which has a different reducibility structure. 
The first two counterterm graphs give the obvious counterterm graphs in 
Fig. 6.6.6(d). These correspond to the first two counterterms in Fig. 6.6.6(b). 
But the last counterterm in (b) has two vertices while the corresponding graph 
(e) has one vertex. Their operator structure is different: graph (e) is another 
counterterm to renormalize the x > y singularity of 6*(x) ¢°(y). Even so the 
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(k? — m°) z 


la) em y 
y 

b) (q?—m?) Lo —O— 
ones 


tnt 
y 

s BE. y 

@ xy QO, 


(e) Oe aaa 


Fig. 6.6.6. Some graphs for (6.6.24). 


two counterterm graphs must be equal. They are both a single free 
propagator times a pole part coefficient times a polynomial in momentum 
times the same power of the unit of mass. They both make the complete 
Green’s function finite. 

The general proof is rather tedious and can be found in Collins (1975b). 
This proof was given for the minimal subtraction scheme, but also works for 
the BPH(Z) scheme. In the original BPHZ proof, by Lam (1972), of (6.6.21), 
there is no treatment of this complication, that the counterterms for the two 
sides are not in manifest correspondence — i.e., that the forests are different. 


Property 5. Ward identities: We will use the notation of Section 2.6 for 
transformations under potential symmetry operations. Let 


Q id Q JT op j 
be an infinitesimal transformation of the fields under which the basic 
Lagrangian transforms as 


Li S pume One ô, Y s 


We are restricting our attention to the transformations generated by one 
particular generator of a group. Thus, as compared to Section 2.6, we now 
drop the index ‘e’, which labelled the generators. The subscript ‘b’ on A, and 
Y, indicates that we areconsidering transformations on the basic Lagrangian 
(i.e., without counterterms). In the equations below, we will add in the 
counterterms by use of our standard renormalization scheme. Note also that 
in setting up the renormalized Green’s functions, in Section 2.8, we defined a 
free Lagrangian Y, and a basic interaction Lagrangian #,. We now work 
with their sum: Z pasi = Lo + Lr- 

We proved earlier the unrenormalized Ward identity (2.7.6). The 
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renormalized Ward identity is 
ô , 
| T Li§(x)] X|0>p 


= (0|T[A,(x)]X|0>, — Korido e AD. (6.6.27) 


p(x) 


We will have to prove it. From it follows 


ô Lb] = [Ae] (6.6.28) 
and by integration over all x: 


0= faxo T[A°(x)]X|0)_ —i5(0|T XO). (6.6.29) 


Of course A = 0 for a symmetry. The current is 


dais . 
[ji] = 5° 30,0, so |-LYA] (6.6.30) 


Proof. In defining Y{ and A,, we have used the basic Lagrangian, i.e., the 
one with the counterterms omitted. This is because we use the operation 
symbolized by square brackets to generate the counterterms. The proof of 
(6.6.27)—(6.6.30) follows the usual proof of Noether’s theorem, but using the 
previously proved properties to write it directly in terms of renormalized 
operators. 

First we use linearity and distributivity to obtain 


— [A,] + [3 j$] = 2 Ô (m )36,- rs | +0, ¥ 5 OL sasiell 


= basic OL vasic 
=r [ate a Ea ô$; 6, 


A X KEAN! (6.6.31) 


From this the Ward identity (6.6.27) follows by the equation of motion 
(6.6.22). 

We exchanged the order of renormalization and tracing over p to write 
ô Lig] =[¢,7¢]. This is permitted — see our remarks below (6.6.4). 


Comments (1) The theorem appears to give an unrestricted proof of the 
renormalized Ward identities. This appearance is false, since there are 
symmetries that can and often do have anomalous breaking — see 
Chapter 13. Such symmetries are dilatation and conformal symmetries and 


162 Composite operators 


chiral and supersymmetries. The potential for such anomalies can be seen by 
computing A” in a regulated theory ; it contains a non-zero coefficient which 
vanishes as d— 4. Such is the case for conformal transformations and for 
chiral symmetries (where the transformations involve y5 or é,,,, explicitly). 
Minimal subtraction is then not easily applicable and the properties we used 
in the proof are false. 

(2) Corresponding problems appear with any other regulator and with 
any other renormalization scheme (Piguet & Rouet (1981)). Minimal 
subtraction confines the problems to cases with anomalies. 


Property 6. Non-renormalization of current: Consider an exact internal 
Symmetry (such as the symmetry that gives electric charge conservation). 
Compute the corresponding unrenormalized current j“ from the complete 
Lagrangian. Now j“ contains counterterms derived from the counterterm 
Lagrangian. We will now prove that these make j“ finite and that 


p= Eel. (6.6.32) 


Comments This theorem does not apply to space-time symmetries — see 
Callan, Coleman & Jackiw (1970), Freedman, Muzinich & Weinberg (1974), 
Collins (1976), Brown & Collins (1980) and Joglekar (1976) for the case of the 
energy-momentum tensor. It also cannot be extended to the case of a non- 
conserved current unless the breaking term has dimension below that of Z 
(Symanzik (1970a)). Furthermore, the proof does not apply directly if the 
transformation d¢@ ,is non-linear in @ ,. It also needs generalization for gauge 
theories. 


Proof. Both j“ and [ j¢ ] consist of the basic current jë plus some minimal 
subtraction counterterms. The difference 


e = jt — [jE] (6.6.33) 


is a series of pure pole terms, and we wish to prove it vanishes. Each term has 
dimension 3 or less (at d = 4), since the currents have dimension 3. 
Now both jë and [j$] satisfy the same Ward identity, so 


ð, <0|Te*(x)X|0> =0. (6.6.34) 


Thus 6," =0, without use of equations of motion; any need to use the 
equations of motion would give a non-zero right-hand side to (6.6.34). In the 
absence of gauge fields, it is impossible to construct such a term. The theorem 
is thus proved. 

In the presence of gauge fields, such terms do exist. For example, in 
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quantum electrodynamics we have a counterterm to the electromagnetic 
current proportional to 0,F””, where F, is the field-strength tensor 0A, — 
0,A,. With non-abelian gauge fields one might have ¢, « ¢,,,,0*(Aj A}), 
but the presence of the ¢,.,,,, indicates a chiral symmetry, which in any case 
needs special treatment. Moreover, in a non-abelian theory, we must also 
take account of the constraints imposed by gauge invariance, which is a 
subject we will not treat until Chapter 12. 

The energy-momentum tensor also has possible counterterms, like 
(0,0, — J, 1) b? - see the references quoted above. 


6.7 Differentiation with respect to parameters in Z 


Consider Green’s function derived from the bare classical Lagrangian 
L = Z(0A)*/2 — mp A?/2 —g,A*/4! 
by using the functional integral: 


Gy = <0|TP(x,)..- P(x) | 
[oA atx,)..-Alayle 
Se (6.7.1) 
[ladles 
Differentiating with respect to gg gives 


g, iaeo. en i |asyaso je 
ae l [roage 


4 [ialen aanes H [[aay(*) fatraso} 


f [tates 
i (4) 
n a e 


oL (y) = = 


=i fa*y<orrdim). oa a 0» ]io>. (6.7.2) 


Similar formulae hold for differentiation with respect to the other parameters 
Z or m3. 
The renormalized equivalents of those equations are also useful. One use 


164 Composite operators 


will be to show, within perturbation theory, that terms quadratic in the fields 
can beshifted between free and interaction Lagrangians without affecting the 
Green’s functions. 

First consider differentiation with respect to the renormalized coupling, g. 
We differentiate the renormalized Green’s function 


Gy(Xq5-065Xy) = zrzec] (6.7.3) 


T 
Applied to each basic graph T in this formula, the differentiation just gives 
—i 
4! 
Renormalization of (6.7.4) produces a set of counterterms isomorphic to 
those in (6.7.3). So 
ô —i 
ago ae d'y <O T{Lo(y)*] — <0] [60)*]|0> } O01)... b(Xw)|0.- 
(6.7.5) 
The subtraction of the vacuum expectation value of [¢*] comes about 
because no vacuum bubbles are used in Gy. Thus each ¢* vertex in 0G, /dgis 


connected to some external line. 
Suppose we let the basic Lagrangian be 


fa O[TAO Plx). (Xn) || no counterterms- (6.7.4) 


L vasic = Z(H) /2 — m p?/2 — gh*/4! (6.7.6) 

and let the free and interaction Lagrangians be 
PL =(6¢)?/2 — m? p? /2 (6.7.7a) 
L, = (z — 1)(ĉp)/2 — (m — mi )o*/2 — 99*/4), (6.7.7b) 


with m? + m3 = m°. Notice that we have allowed the (d¢)* term to have an 
arbitrary coefficient. We choose to put some of the terms quadratic in ¢ into 
the interaction, so that we can derive an equation for 0G,/0z or 0Gy/0m? 
like (6.7.5). Then we will show we can move the quadratic terms to the Zo 
without changing the Green’s functions. 

Differentiation with respect to z or m? gives 


<-Gy=3 | dé yO] T{[(0G)?] — <O|[(2b)?]]0>} A) Hw), 


(6.7.8) 
L Gy=— [as olT Ego — (0|[A0)7 10>} 6(x,)--- (xy) 10. 
(6.7.9) 
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However, we may want to put all of (6@)?z/2 — m*7/2 into the free 
Lagrangian. In this case the free propagator is 


i/(zp* — m°). 
We wish to prove that (6.7.8) and (6.7.9) remain valid. 


In this case 0/0m? applied to an unrenormalized graph I gives a sum over 
terms in which each propagator is differentiated 


ô 1 i $ 

eae” -i( =a) (6.7.10) 
This gives us the same result for the unrenormalized graphs as the right-hand 
side of (6.7.9). Next, we differentiate a counterterm graph.C,(I/). Either a 
propagator is differentiated, so that the — iin (6.7.9) gives the basic vertex for 
[ 7(y)], or acounterterm C(y, ) is differentiated. In this second case there is 
also a counterterm graph C,(6I'/dm’) with a term dy, /0m? 

Now 


C(éy, /Om?) = 8C(y,)/am? (6.7.11) 


in the minimal subtraction scheme. (The reason is that both are defined to be 
pure poles times uto a power — the same for both graphs.) We thus obtain all 
the counterterm graph for the right-hand side of (6.7.9). 

Similarly (6.7.8) is true if z(6@)* is all in the free Lagrangian Lp. 

We thus see that, for any renormalized parameter å in the Lagrangian Z, 
we have 


0 ; d OD basic = OE aaa 
oxi [ascori] a4 o) (off EF (0) lot...) 


(6.7.12) 


From this result we can see that the Lagrangian (6.7.6)is equivalent to the 
one with unit kinetic term 


LPi =(ô¢')}?/2 — m? "2/2 —g'b'*/4! (6.7.13) 
by a scaling of the field, with 


jes Uo (6.7.14a) 
m? = zm’?, (6.7.14b) 
g=z g (6.7.14c) 


The proof is to write (6.7.6) as 
ATN z(ôġ)?/2 — m? zo? /2 — g'z7*/4!. (6.7.15) 
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Then differentiation of a renormalized Green’s function Gy of o with respect 
to z gives 
zd Gy/dz = z0Gy/02Z|tixeam.g 
F m° 6Gy/dm? loed z,9 + 2g0Gy/0g laeda 


= ifa*y<o T {[2(0o)?/2 — m? 2/2 — 294/41] 


— vacuum expectation value} $(x,)...@(xy)|0> 
=; [avo T {[ £3] — vacuum expectation value} x 


x (x)... (%y)|0>). (6.7.16) 
We now use the equation of motion (6.6.21) with A = ¢@ to give 
zdG,/dz = — NG,/2. (6.7.17) 


From this it follows that 
_ „—-N/2 
Gyle.g.m2 =Z Gyl|z> 1,g—>g',m? >m’? 
1.€., 


COLTA)... P(%n)]O> = 2°" (0|T b'(x1).-- $n) 0D, (6.7.18) 


exactly as we would expect. The proof is non-trivial only because we are 
shifting terms between the free and interaction Lagrangians. Thus we must 
ensure that counterterms do not go astray. 


6.8 Relation of renormalizations of ¢* and m? 


Observe that at order g? the renormalization factor Z, for ¢? in (6.2.13) and 
(6.2.11) is the inverse of the renormalization factor m3/m7. This relation is 


true to all orders, as we will now prove. (We are now back in ¢° theory at 
d= 6.) 
We use the renormalized formula 


mans Gy = i faty< 0| rims — vacuum expectation value} x 
m 


x &(X,)..-O(Xy)|0> 


= Time fat yO] TLA- <OlLd2I]0>}4¢e,)-—- 610). 


But we also have (6.8.1) 
L =(ôpo) /2 — m° Zmpo/2 — Job0/6, (6.8.2) 
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so that 
m?0L /dm? = — m?Z,,2/2. (6.8.3) 
Hence 
ð i 
gT En = ~5Zy 890 T (93 — <0143l0>} dte). 0> 
(6.8.4) 
Therefore 
| d*ym’[ 7] = |d ymos, (6.8.5) 


from which the desired result follows. 
Generalizations of this method can be found in Brown (1980) and Brown 
& Collins (1980). 
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Renormalization group 


As we saw in Chapter 3, the renormalization procedure has considerable 
arbitrariness: the counterterm for a graph must cancel its divergence but 
may contain any amount of finite part. A rule for choosing the value of the 
counterterm we called a renormalization prescription. In one-loop order it 
was Clear from the examples that a change in renormalization prescription 
can be cancelled by a change in the finite, renormalized couplings 
corresponding to each divergence. Thus a change in renormalization 
prescription does not change the theory but only the parametrization by 
renormalized coupling and mass. What is not so easy is to see that this 
property is true to all orders. This we will show in Section 7.1. The 
invariance of the theory under such transformations is called 
renormalization-group (RG) invariance. 

A particularly useful type of change of renormalization prescription is to 
change the renormalization mass yp. Infinitesimal changes are conveniently 
described by a differential equation, called the renormalization-group 
equation, which is derived in Section 7.3. This leads to the concept of the 
effective momentum-dependent coupling. This concept is very useful in 
Calculations of high-energy behavior, as explained in Section 7.4. The 
coefficients in the renormalization-group equation are called the 
renormalization-group coefficients and are important properties of a 
theory. Various developments of the formalism occupy the remaining 
sections. 

The renormalization group was first discussed by Stueckelberg & 
Petermann (1953) and by Gell-Mann & Low (1954). Very similar ideas are 
applied in statistical physics (Wilson & Kogut (1974)). Many important 
recent applications arise because of the asymptotic freedom of QCD. 

Results of calculations of renormalization-group coefficients can be 
found in many places: 


(1) Gross (1976) lists many one-loop results for theories with scalars and 
fermions and up to two loops for gauge theories with only fermions. 

(2) Cheng, Eichten & Li (1974) give the B-function for a general renormaliz- 
able theory to one-loop order. 
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(3) Tarasov, Vladimirov:& Zharkov (1980) compute renormalization- 
group coefficients to three-loop order in gauge theories with fermions 
using minimal subtraction. 

(4) Vladimirov, Kazakov & Tarasov (1979) compute to four-loop order in 
ot theory. 

(5) Chetyrkin, Kataev & Tkachov (1981) and Chetyrkin & Tkachov (1981) 
compute the anomalous dimension in ¢* theory at five-loop order. 

(6) Tkachov (1981) summarizes the methods used for the above 
calculations. 

(7) Caswell & Zanon (1981) perform calculations in supersymmetric 
theories at three-loop order. 


7.1 Change of renormalization prescription 


7.1.1 Change of parametrization 


The techniques we will describe are valid for any theory. However, to be 
specific, we will mainly work with the theory we have been using as a source 
of examples, the ¢° theory in six space-time dimensions. There are three 
alternative, but equivalent, forms in which to write the Lagrangian. First of 
all, we can write it in terms of the bare field ¢,: 


L =(AGo)?/2 — m3.$3/2 — goG3/6. (7.1.14) 
(As before, we ignore the term linear in ¢@.) The importance of this form is 
that the bare field ġo is invariant under change of renormalization 
prescription: its normalization is determined, because it satisfies canonical 
equal-time commutation relations. 

When we renormalize the theory, we obtain finite Green’s functions of 
the renormalized field ¢ = Z7! ġo. In terms of the renormalized field, the 
Lagrangian is 

L = Z(0$)*/2 — m3Z$?/2 — GZ"? /6 
= Z(0g)?/2 — mpo*/2 —g,¢°/6. (7.1.1) 
This is the second of the three forms. 

In the perturbative theory of renormalization, we wrote the Lagrangian 
as the sum of a free Lagrangian, a basic interaction Lagrangian, and a 
counterterm Lagrangian: 


P=Ly +P, + Pa 
= Z p H Ler: (7.1.10) 


This is the third form of the Lagrangian. Here, we have chosen to define a 
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basic Lagrangian 
L vasi = ()?/2 — m b?/2 — pw?“ gh" /6, (7.1.2) 
where mand g are the renormalized mass and coupling. Since we will mostly 
use minimal subtraction, it is sensible to define g to be dimensionless, and 
therefore to introduce the unit of mass u. The counterterm Lagrangian is 
L „= 5Z(0)"/2 — dm h*/2 — 6967/6, (7.1.3) 


and the counterterms 6Z, 6m”, and ôg are computed as definite functions of 
g, m and yp with the aid of some renormalization prescription. 
To be concrete, let us use minimal subtraction, so that 


ôZ = 9 (6—d) /c,(g,m, p), (7.1.4a) 
j=1 
ôm? = m? X (6 — d) 4b,(g,m, p), (7.1.46) 
j=1 
ôg = p°? Y (6 -d)~4a,(g, m, p). (7.1.4¢) 
j=1 


We saw, in Section 5.8, that in fact the coefficients a;, b;, c; are independent 
of m and u; they are functions of the dimensionless coupling g only. 
However we will not use this fact at the moment. 

The three forms of the Lagrangian listed in (7.1.1) are equivalent — if we 
use any of them in the functional integral, then the same Green’s functions 
will result. The coefficients Z, m2, and g, will be singular when d approaches 
6 with g,m, and p fixed. The singularities will be just such as to give finite 
Green’s functions of ¢ at d=6. 

The parametrization of the Green’s functions by g,m, and p is rather 
arbitrary. Suppose that we change variables to g’, m’, and py’, which are 
some given functions of g, m, and u. These functions may even depend on the 
regulator, d, provided that the change of variable remains non-singular at 
d = 6. Then we get the same theory, for the collection of Green’s functions 
Gy is unchanged. It is just the numerical values of the renormalized mass 
and coupling and of the unit of mass that have changed. 

We may even change the scale of the renormalized field by writing 
go’ =f, where ¢ is finite. The Green’s functions are now different: 


Go=C Gy: 
But observe that the value of ¢ is irrelevant for a physical observable like the 


S-matrix. For example, consider an S-matrix element involving N particles. 
It is obtained from Gy by (1) dividing out an external propagator G,(p,) for 
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each external line and (2) multiplying by Z'/? for each external line, where iz 
is the residue of the pole of G,(p). Finally we let the momenta p; go on-shell. 


Thus 
N 
S= lim | Gné?/TI Ga} 
P >m i=1 
= lim (GRCI [G3}. (7.1.5) 
But 
G= G (7.1.6) 


so the particle pole is at the same position in G; asin G, and the new residue 
1s 


2’ = (72. (7.1.7) 
Hence 
S = lim {GÊ N/T ] G3}. (7.1.8) 


and the S-matrix is invariant, as claimed. 


7.1.2 Renormalization-prescription dependence 


In the bare Lagrangian (7.1.1a), there are only two parameters. So there 
should be only a two-parameter collection of physical theories obtained 
from it. As we have just seen, the freedom to vary the scale of the 
renormalized field @ in the second form of Y, viz. (7.1.1b), does not 
introduce a third real parameter into the physics. 

Unfortunately, we appear to have introduced a large and indefinite 
number of parameters by having to choose one out of the infinitely many 
possible renormalization prescriptions. One might suppose that in different 
renormalization prescriptions, the singular behavior of m, and g, as d— 6 
could be different in such a way that one picks up different phases of the 
theory. In fact, this is not so. We will show that a change of renormalization 
prescription is one of the reparametrizations discussed in the previous 
subsection 7.1.1. This is the property we have defined as renormalization- 
group invariance. 

Even within a single renormalization prescription, we have introduced a 
third parameter, the unit of mass, u. Notice that the basic Lagrangian does 
not depend on pand g separately, but only on the combination p> ~ 4/*g; but 
notice also that this property is not true for, say, the renormalized Green’s 
functions at one-loop order. However, a change of u is in effect a change in 
renormalization prescription. Indeed, we could include in our definition of 
the renormalization prescription the requirement that u and m have a fixed 
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ratio; we would still have two free parameters. A change in this ratio is then 
a change in renormalization prescription. Our proof of renormalization- 
group invariance will, in fact, only explicitly cover the case of a change in p. 
The more general case will be essentially the same. 

We will prove that if we change u to p’, then the physics is unchanged, 
provided that we choose suitable new values, g’ and m’, for the renormalized 
coupling and mass. That is, an S-matrix element S(g,m, u) satisfies 


S(g, m, p) P: Sg’, m, p’). 
The bare mass and coupling m, and g, are similarly invariant. 
The new renormalized field ¢’ with the new values (g’,m’, u’) of the 
parameters is not the same as with the old values but is related by 


¢ = ¢¢, 
where ¢ is a finite function of g, m, pand py’. Hence the renormalized Green’s 
functions satisfy 
Gy(P1>- ee >Pnig,M, u) = Co GAD so. Puig o m’, u’) 
=; NG. (7.1.9) 
It will be convenient, in our proof, to compute Green’s functions of the 


original field ¢, but with the new value p’ of the unit of mass. Now, in terms 
of the new field ¢’, the Lagrangian is 


L =Z'0g'?/2 — mè o'7/2 — 939'?/6. (7.1.10) 
Here we write 
Z' =2Z(g’,m'/p', d), (7.1.11a) 
my = m}(g',m', u’,d), (7.1.11) 
95 = JBlg', p, d). (7.1.11¢) 


These are the same functions of the new renormalized parameters m’, g’, and 
u’ as the original bare parameters were of the old renormalized parameters 
m, g, and u. To get the Green’s functions of the original field ¢ but with the 
new value of the unit of mass, we substitute {ø for ¢' to obtain 


L=Z(g',m'/p',d\l*eg*/2 — malg’, m'/u', a)c*g?/2 
— gp(g',m', p', a)" p*/6. (7.1.12) 


7.1.3 Low-order examples 


Let us remind ourselves how the changes in g and m are obtained at one- 
loop order. We must start at tree approximation, where we ignore all 
counterterms. In order that the two formulae (7.1.1b) and (7.1.12) for Z be 
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the same in tree approximation when we change u to yp’, we must write 
9’ = Inew = (u/h')? g, 
m =m, 
¢ = 1. (All tree approximation.) (7.1.13) 
Note that at d = 6 we have g’ =g to lowest order. We distinguish g’ and 


Janew: Inew 18 defined to be exactly the value of g’ in tree approximation. In 
higher-order calculations, g’ gets corrections, but g e, will be defined to be 


(u/p')? 4g always. 
To treat higher-order corrections, we write the Lagrangian (7.1.12) as 
L = (84)? /2 — mg?/2 — p> g(u/p')?-“?]6°/6 
+ 6'Z(dp)?/2 — 6'm*?/2 — 6967/6. (7.1.14) 
Our strategy will be to express the counterterms in (7.1.14) as minimal 
subtraction counterterms plus some new finite pieces. The finite pieces will 
accomplish the change of parametrization. 
First we consider the one-loop graph for the self-energy, Fig. 3.1.1. The 
unrenormalized value is 
_ ig°T(2 — d/2) f 
‘1287? J, 
This is invariant under the transformation (4, m, g)—(u’, m, 
g(u/u')?~ 77). If we use unit of mass u, then the counterterm is 


C(T,)= — pole (T) 


dx [m? — p?x(1 — x)]”2~2(4np2)3-42. (7.1.15) 


bs ig? 


= 64nd —6) -ø (m — <p”). (7.1.16) 


Next we use pw’ and g ey =(u/y’)? “/*g instead of u and g. The 
counterterm changes to 


C(r ) a = 19? ow (m? poi ip?) 
1 — 64n?(d — 6) = 
— ig? ee a ae 
= —= — ; 7.1.17 


Notice that we define C'(T,) to be the negative of the pole part of T, 
with the d-dependence of g ew ignored. That is, we consider the function 
TY, =r (P, Jaw mM, pb’) and extract its pole at d=6 with g,.,, fixed. This 
prescription ensures that we may later replace g ew by its value at d=6 
without changing the renormalized value of the graph. 

Since both C(I,) and C'(T,) cancel the divergence, their difference is 
finite. We may therefore obtain the same value for the graph plus 
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counterterm by letting the counterterm coefficients in (7.1.14), namely 6’Z 
and ô'm?, each be a sum of two terms: 


2 2 , 6-d 
17 Inew Inew (u /u) =] 
2 = shea aal ee) Vow 
mg: m*dnew| (u‘/u)° 4 — 1 
ay an new new 
Om = Gani(d—6) 6n | d—6 | ve 


The first term in each equation is the minimal subtraction counterterm for 
the new coupling, while the second term is finite as d > 6. Using the formula 
(7.1.12) for the Lagrangian, we may regroup these terms to give 


2 11, \6—d 
7 Inew (u'/u) =l 
e -1+ 16 l 
2 Fi-u) 
EE r H] (7.1.20) 
22 Es '\6—d 
m'2C? =m + ae | (7.1.21) 


Hence we obtain the value of m’: 


2 oS \6—d 
m’? = “ft 5g [ER |+ 00%} 


384r d—6 
5g? , 4 
—+m?| 1+ =—— In (u/u') + O(g*) | asd-6. (7.1.22) 
384r 
Also at d = 6 we have 
2 
g 
d = 6) = 1 + ——_~ s +, 7.1. 
Cd = 6) =1 + 5 in(u/u') + O(g) (7.1.23) 


We can apply the same procedure to the vertex graph, Fig. 3.6.1, whose 
value is the factor in curly brackets in (5.3.5). The counterterm 6’g is written 
as 


3 ,.13-d/2 3 ,3-d/2 1), \6—d _ 
A eee caine C ae | (7.1.24) 
64n3(d—6) | 64n d—6 


with the result that 
2 , 6-—d 
: _ g (u/n)? t1 
37/,,/3—4/2 — 13-—d/2 1 new 
£ gH GnewH a sa| d—6 


g? [1 u/u) À 
= gu? ai re SHEN (7.1.25) 
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It follows that 


eae 3 2 Liss ‘\6—d P 
g' = gluu" fiaa T | og j 


3g? 
a| + 55g in (u/u') + Og | as d—>6. ( ) 


Our strategy for understanding the effect of a change in renormalization 
prescription is to absorb the difference into a finite counterterm. The 
counterterm will itself generate divergent counterterms when we insert it 
into a bigger graph. Finally we reorganize the Lagrangian by putting all the 
finite counterterms into the basic Lagrangian. Then we see that the change 
in renormalization prescription is exactly compensated by a change in the 
parameters of the theory. 

What happens when we go to higher order? An example is given by the 
two-loop self-energy graph of Fig. 7.1.1(a). Graphs (b), (c), and (d) renormal- 
ize its subdivergences and its overall divergence. We write the renormalized 
graph with the original value of u as 


X,(p, g, m, u) = 2, T 2X, = 2 (7.1.27) 


(We used the fact that X, = £Ł..) The unrenormalized graph is unchanged if 
we replace u by p’ and g by g ew. But the vertex counterterm is treated 
exactly as at (7.1.24), so that: 


Ł,(Pp, g, m, u) = Ł,(P, Jnew» m, u’) + [X,(p, g, m, H) rae Lp(P, Inew? m, p')]. 
(7.1.28) 


The first term contains the counterterm for one of the subgraphs, computed 
by minimal subtraction with unit of mass p’. The second term exactly 
compensates the difference. It has the counterterm replaced by the finite 


(a) (b) 


—— 


(d) 
(c) 


Fig. 7.1.1. Self-energy graph with counterterm graphs. 


176 Renormalization group 


part in (7.1.24), viz. 
3 13-—d/2 , 6-d 
Jnew =l 
a a] mae 


We now write 


È, = LA(Ps news M, H’) + 224(P; Inews M, H’) + LalP; news M, H’) 


A 
+ far, ae T 24(P, g,m, u) a X4(P, Inew? M, w} (7.1.30) 


Here we wrote out the first three terms as the minimal renormalization of 
x, with unit of mass py’. The remainder is finite, since £, is finite. The term 
2r ,Ag/(Gnewlt’? 7) is the one-loop self-energy graph with one of its 
couplings replaced by Ag. It has a divergence which can be cancelled by a 
minimal counterterm: 


= 1GnewAg 2 1.2 
= Cando” oP). 7.1.31 
amado" TEP) (7.1.31) 
Hence the term in curly brackets is 
2r ,Ag 
Pera + zc | + [24(p,9,m, u) — ZalP, Janew M, U’) — 2C]. 


(7.1.32) 


The second term is finite, since there are no remaining divergences. It is of 
the form 


i( — Am? + AZp°), 


and so gives rise to another finite contribution to m’? and to ¢. 


7.2 Proof of RG invariance 


To show to all orders of perturbation theory that g’, m’ and ¢ can be chosen 
so that (7.1.9) holds, we generalize from our treatment of the examples. We 
write 
Cone = gp 4/2 + Ag, 
(7 =1+ AC?, 
gm? = m + Am’. (7.2.1) 
The original version of the theory has counterterms computed with unit of 
mass p: 
P= Lrict+ Lal). (7.2.2) 


We now change to unit of mass yp’ and wish to show that identically the 
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same Green’s functions and total Lagrangian are obtained if we make 
changes of the form (7.2.1). 
The Lagrangian is written in the form 


ff = A ae + Lo + Zs (7.2.3) 
with the basic Lagrangian the same as before, but written as 
L vasic = (06)?/2 — m*h?/2 — GnewH’ >“ */6. (7.2.4) 


(As before, we define g_.. = (u/u') “g.) The term Y is a ‘compensating 
Lagrangian’ of the form 
L = AlC70g?/2 — Am? 7/2 — Agd?/6. (7.2.5) 

The counterterm Lagrangian #2, in (7.2.3) is computed using minimal 
subtraction with unit of mass p’. 

We may later reorganize (7.2.3) so that the basic Lagrangian is taken as 
F asic + Z- We may drop the d-dependence of the finite counterterms Ag, 
AC?, and Am, since renormalized Green’s functions are finite functions of 
renormalized quantities. Finally we may rescale the fields to give (7.1.10). 

But a proof is most easily given with the form (7.2.3). Each of the finite 
counterterms is computed as a sum of terms, one for each 1PI graph 
contributing to the relevant Green’s function: 


Ag = Y A,(g), ete. (7.2.6) 


Particular cases are given by the examples in Section 7.1. Thus 


2 , 6-d _ 
sate da [eta] 


384r? d—6 
A m? te mg? ow (u'/)°~4 =l 
ae 64r? d—6 
3 , 6-d 
13 — Inew (u /p) — 1 


A,. aC p a m°) cae ais i[Z,(p, g.m, p) = X4(P, Inew? M, p’) 7 2C], 
where the label on A indicates the number of the figure depicting the basic 
graph. 

The general proof is by induction on the size of a graph. Consider a 
graph G contributing to some Green’s function. Using the basic interaction 
— gp? —*/* 63/6 we renormalize it with unit of mass u using the method of 
Section 5.11. The renormalized value of G is then 

R(G)=G+ f CG). (7.2.7) 


yoG 
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Here the sum is over subgraphs of G that consist of one or more disjoint 1PI 
graphs, and C,(G) denotes the replacement of each of these 1PI graphs by a 
counterterm with unit of mass u. Similarly we can renormalize G with a 
different unit of mass yp’ but with the same basic interaction 
— GInewtt’>~“'*h*/6 to get 
R(G)=G+ f CG). (7.2.8) 
ySG 
Here, we use the prime to denote use of the unit of mass py’ instead of u. 
We will now derive a series of new basic graphs containing finite 
counterterms. These counterterms will be used to generate the compensat- 
ing Lagrangian Z, in (7.2.3). The new graphs will need renormalization, 
and we will arrange them so that when they are added to R'(G), we get back 
the original value R(G). The aim will be to have a finite counterterm A, for 
every 1PI graph y that is a vertex or self-energy graph. We will arrange the 
A,’s so that 


R(G) = r'| o +) A(G)+ ao | (7.2.9) 


y¥¢G 


Here the sum over y is over products of 1 PI graphs. The overall counterterm 
for a graph is computed using minimal subtraction, but with the d- 
dependence of g,.,, and of the finite counterterms A, ignored (see our 
remarks below (7.1.17)). That is, the counterterms in R’ are a series of poles 
at d = 6 with their coefficrents being power series in g,,,,. In the case of a 
vertex'subgraph, there is also the usual factor u’? ~ “*. The finite subtraction 
A(G) for the complete graph in (7.2.9) is only non-zero if G is a 1 PI vertex or 
a self-energy graph. 

We will prove the following relation between counterterms for a 1PI 
graph 


Ciy)=C f F2 440) + A(y). (7.2.10) 
ogy 


The above equations (7.2.9) and (7.2.10) are trivially true for tree graphs, 
where no counterterms are needed, for we can set A(tree graph) = 0. So let 
us assume they are true for all graphs smaller than a given graph G, with all 
the A,’s finite. There are two cases: (a) G is not overall divergent; then we 
must prove (7.2.9) with no counterterm A(G). (b) G is 1PI and overall 
divergent; then we use (7.2.10) with y replaced by G to define A(G). We must 
prove A(G) finite and prove (7.2.9). We must also prove A(G) is polynomial 
in the external momenta of G, with degree equal to the degree of divergence 
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of G; we will assume inductively that this is true with G replaced by any 
smaller graph. 

Consider the terms in (7.2.7). For each we identify a contribution in 
(7.2.9). The term G is the same as G in R'(G). Next let y be a 1PI subgraph of 
G (G itself being excluded). Decompose C,(G) by (7.2.10): 


(1) The C’(y) term occurs in RG) as C; (G). 

(2) The term A(y) occurs as A,(G). 

(3) The term C°(A,(y)) occurs as a counterterm in the renormalization 
R‘(A,(G)). | 


If G is not overall divergent, these exhaust all of the terms in R(G) and on 
the right-hand side of (7.2.9). But if Gis 1PI and overall divergent then there 
remains C(G) in (7.2.7) and the terms 


c| 6 +y aG) | + A(G) 
ÒG 


in (7.2.9). We are therefore forced to define A(G) by (7.2.10), and it remains to 
prove A(G) finite. This is now easy, since 


R(G), R'(G), and r'| y s0) | (7.2.11) 
gG 


are all finite, while we have 
R'(A(G)) = A(G). 

Moreover all the terms in (7.2.10), except possibly A(G), are ordinary 
minimal-subtraction counterterms. So they are polynomial in the external 
momenta of G, with degree equal to the degree of divergence of G. So A(G) is 
polynomial, of the same degree. (Note that the replacement of a subgraph y 
by A, does not change the overall degree of divergence of any graph T 
satisfying y © I € G. This is because of our inductive assumption on the 
polynomial degree of A,.) 

The theorem is also true if R and R’ stand not for renormalization with 
different units of mass, but for any two renormalization prescriptions. It is 
important that the d-dependence of A, is taken outside of the extraction of 
pole parts when computing a counterterm like C,(A,). This can be seen 
from the example of Fig. 7.1.1, at (7.1.31). The primed counterterms must be 
a particular function of g aew, u’, m’, and d. The fact that g’ itself is a function 
of other variables is ignored. 

Equation (7.2.10) expresses the counterterm C(y) for a graph y (with unit 
of mass u) in terms of renormalization counterterms with unit of mass p’ 
and a finite counterterm A(y). We can use the finite counterterms to generate 
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the compensating Lagrangian Z, in (7.2.3), and the primed counterterms to 
generate Zi. It should be evident that the new Lagrangian is the same as 
the original one (with unit of mass py), considered as a function of ¢ and d¢. 
If we set 6, = Z'/*¢, with Z = Z(g, m, u), then we can deduce that the bare 
parameters m, and g, are renormalization-group invariant: 


m(g, m, H) = m(g’, m’, u’), 
golg, M, u) = go(g’,m’, p’). (7.2.12) 


7.3 Renormalization-group equation 


We saw in Sections 7.1 and 7.2 that a change in the unit of mass pn 
accompanied by suitable changes in coupling and mass does not change the 
theory, while the Green’s functions satisfy 

Gies hy GIO SO Ga Kose g mn). (7.3.1) 
We wish to compute g’, m’? and ¢ as functions of g, m°, u, and jy’. If the ratio 
u/p' is large, it is not sufficient to use lowest-order perturbation theory, 
since, for example, in (7.1.26) the coefficient of g? may be large. 

An important device is to consider a large change in u as being made up 
of a sequence of very small changes, so that g’, m’*, and ¢ are obtained as 
solutions of differential equations. This is the subject to which we now turn. 

The consequence of our work in Sections 7.1 and 7.2 is that for a given 
physical theory, we have for each value of u a definite value of the coupling 
g(u) and mass m(u). These are called the effective (or running) coupling and | 
mass. We will derive differential equations for g(u) and m(). 

The easiest way to derive the results is to look at the Green’s functions 
and the Lagrangian expressed in terms of the bare field ¢,. The important 
point is that the Green’s functions of ¢, are invariant under our change of 
parametrization (y,g,m)—(y’,g’,m’). (This is because the mass and coup- 
ling in the bare Lagrangian are invariant.) 


7.3.1 Renormalization-group coefficients 


Physical quantities like the S-matrix are invariant under the change of 
variable (u, g(u), m(u)) > (u’, g(u’), m(u’)). This invariance is conveniently 
expressed by considering a small change in u, accompanied by the 
corresponding changes in g and m. We write the result as 
udS/du = 0. (7.3.2) 
The total derivative with respect to u can be written as 
ud/du = uð/ôu + Bd/dg — y,,m?0/0m?. (7.3.3) 
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On the right-hand side the partial derivatives with respect to u, g or m are 
taken with the other two fixed, and the coefficients B and Ym give the 
variations of g(u) and m?() when jis varied. The sign of the m2d/dm? term is 
the usual convention (Weinberg (1973)). We have 


B = udg(u)/dy, (7.3.4) 
Ym = —m udm? (u)/dp. (7.3.4b) 


The coefficients B and y,, are called renormalization-group coefficients. 
As we will see they are easy to calculate in terms of the counterterms, as 
functions of g, m, u, and d. If we use minimal subtraction, they have no mass 
dependence. This means that (7.3.4) can be readily solved as differential 
equations for g(u) and m(u). Indeed this is the easiest way in practice to 
compute the effective coupling and mass. 

To compute f and y„ it is convenient to consider the Lagrangian 
expressed in terms of the bare field @, — see (7.1.1a). We saw in Section 7.2 
that m and g, are renormalization-group invariant: 


udgo/du = 0, (7.3.5) 
pdmé/du =0. 
Suppose we have computed g, and m, to some order in g. Then (7.3.5) can 
be solved to give B and y- 
An example of such a calculation comes from our results on °? theory, 
where 
2 


3 
Jo = ptg 1 pee ota") | 


256r°(d — 6) 
2 2 5g” 4 
Thus 
= dgo — ,,3-d/2 5 
ear a fas- d/2) — 2 + 0095) 
9g? i 
T a + 256n'(d —6) + O(g |}. 
SO 
B(g) = (d/2 — 3)g — =i =z- + 0(g°) 
3g° 5 
-56n + O(g°) atd=6. (7.3.7) 
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Similarly 
dm? 
0 = u— 
ET 
= — mya [1 + 0(g?)] + m2p(g)| ——2— + 019°) 
" 1927°(d — 6) 
so that 
5 g? J 
Im = E (7.3.8) 


Observe that the (d/2 — 3)g term in f is important in these derivations, even 
though the term disappears at d = 6. Observe also that, even though the 
coefficients in gọ and mọ diverge at d = 6, it is crucial to expand strictly in 
powers of g. A phenomenon true to all orders is that B and y„ are 
independent of m and yp, provided that we renormalize by minimal 
subtraction. 

The general calculation of B and y,, can be organized with the aid of (7.1.4) 
for the counterterms. Since we use minimal subtraction, the m- and u- 
dependence of ihe bare parameters is simple: 

Jo = K?" Go(G, 4), 

me = m° Z„(g,d). (7.3.9) 
Then 
ÔJo 


B(g, d) = (d/2 — | Be, 


ôln Z 
Ym(9) = BU, are (7.3.10) 
g 
The expressions (7.3.9) are to be expanded in powers of g with the aid of 
(7.1.4). Now 


Jo = (Wg + dg)Z~*? 


— 3 
spa g 4 0 39¢ (9) schipher poles | 
6-—d 
Z, = (1+ 6m*/m?)Z~! 


=|: +) + higher poles | (7.3.11) 


where we have picked out the single poles in the series expansion of the 
counterterms (7.1.4). (These are all that will be relevant.) Then (7.3.10) 
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becomes 
— 
+ alg) — 29°19) 29¢1(9) + higher poles 
d 6—d 
a Oa a‘ (g) — gc; (g) — 3c,(9) 
[ + B e cera + higher poles | 


= (d/2 — 3)g + (i — 7 [$g9c,(g) — a,(g) ] + poles (that cancel) 


= (d/2 — 3)g + Bg), (7.3.12a) 


b,(g) — ¢,(g) 
6—d 


bs 0 , 
Ym(g) = [(d/2 — 3)g+ B(g)] al + higher poles | 


0 
= 29 Lex(9) — b,(g)]. (7.3.12) 


These manipulations are made by expanding in powers of g. 

In the last line of each of (7.3.12) we have used the fact that although pole 
terms are in principle present, they must cancel in order that B and y,, be 
finite as d > 6. Notice that $ and y,, are independent of m and y, and that the 
only d-dependence is the (d/2 — 3)g term in $. Only the single-pole terms are 
needed for the calculation. There is a series of relations between these and 
the higher poles that ensures that the poles cancel in (7.3.12); we will 
investigate these later. 


7.3.2 RG equation 


The RG coefficients B and y, are computed from two out of three 
combinations of the counterterms. The differential equat‘ons (7.3.4) then 
enable g’ and m’ in (7.3.1) to be computed. To complete the calculation we 
need ¢. This is related to the wave-function renormalization. It is easiest to 
obtain by observing that the bare Green’s functions G® = Z’? Gy are 
renormalization-group invariant, so that 


d d z 
u—Gy= Hg Cv ~12) 


du 
N d 
= — —u— (in Z)G,. 3. 
aS )Gy (7.3.13) 
Let us define the finite coefficient 
In Z: 
Y= du nZ; (7.3.14) 
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then Gy satisfies the following renormalization group equation (e.g. 
Weinberg (1973)): 


ogre Gy = aay ae m? i Pe Gy=0. (7.3.15) 
Ju PLA ui ER ag a dae Te GANS 2. 
In the minimal subtraction scheme for ¢° we find 


y= [0/2 30+ BO Jgs 9 + higher poles | 


l d 
= ~ 29. G5 619) 


g? 
4 7.3.1 
= se + O(g) (7.3.16) 
7.3.3 Solution 
We wish to solve the RG equations to find g(u'), m(u') and ¢(p', u) in (7.3.1), 
given that g(u) =g and m(u) =m. The RG equation tells us that 
1 d 
r = ye di — In [Gy(u')/Gy(u) J 


= — 77[9(u')]. (7.3.17) 


So we must solve this equation together with (7.3.4) for g and m. The 
boundary conditions are 


glu) = 9, 
m(u) = 
C(u, u) = 1. (7.3.18) 
Explicit solutions can be written: 
gin’) dg’ 


mind | aa 


m?(u') = m?(u) exp -fS = SE yaa} 


= 2 dg’ | 

= mex} — Jo wo BG) S 
udu 

cu) = exp} ~ T- F T 


g(u') yg’) 
= dg 7.3. 
api- f a ads oe 
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Approximations can be made by taking a finite number of terms in the 
perturbation series for p, y,, and y. For example: 


256r? (9%) dg 
In(u'/uy) = ——3 | ~ [1 + O(g°)] 
g(u) g 


128r°[ 1 1 glu’) 
aT -7 p o| in( 4) | (7.3.20) 


This is accurate if g(yu’) and g(u) are small. Notice that g(y’)—0 as 
u’ — œ. This is the property called asymptotic freedom. It is determined by 
the negative sign of the first term in f at d=6. 

The full solution to the RG equation is 


Adu 
Gy(x;9,m, p) = exp] -5 | Fata) |Gsesiaw mu’), u’). (7.3.21) 


u 


7.4 Large-momentum behavior of Green’s functions 


The most important application of the renormalization group is to 
compute large-momentum behavior. In this section we treat the simplest 
case, that of a Green’s function Gy,(p,,...,py) all of whose external 
momenta are made large. (Notice that we have used our standard notation, 
where the tilde indicates Fourier transformation into momentum space.) 

Let us suppose initially that all the Lorentz invariants formed from 
the momenta are non-zero. Then we scale all the momenta by a factor x: 
pi> kp;, and let x get large. Thus all the Lorentz invariants p;p, are scaled 
by a factor k? and become large. Under these conditions, Weinberg’s 
theorem tells us that at least in a renormalizable theory all internal lines of 
graphs for Gy carry large momenta, and that graphs for Gy have the 
asymptotic behavior 


x © (logarithms of x). (7.4.1) 


Since all propagator denominators are large, we should be able to neglect 
masses and make only an error a power of x smaller than the leading 
behavior (7.4.1). 

For example consider the propagator in ¢° theory at d=6. The tree 
graph goes like i/p” at large p?, while the one-loop correction is 


i g? f, l — p?x(1 — x) 
p 12823 lio —1)+ {. dx x(1 — x) in| a 


+ o| Tyin (=) |} (7.4.2) 


(We used (3.6.10) for the self-energy graph to derive this equation.) 
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Now there is a term proportional to In(— p?/y7)/p? that gets large 
relative to the tree graph if p? is large enough. Thus the perturbation series 
has large coefficients and is not directly useful. However we may use the 
renormalization group to set u? = O(p”). This makes the coefficient small 
again. So we use the following strategy to compute G,(x«p) at large x: 


(1) Set yw’ = kp, and use the solution of the RG equation to write 
Gy(Kp,g,m, u) = Cp, u) “Gy(«p, g(Kp), m(KL), Kp). (7.4.3) 


(2) Neglect m (if m(ku) does not get too large). Then use dimensional 
analysis to give 


Gulp. go, u) ~ 6 "Gy(Kp, y(cy), 0, Ku) 
= Kain (acu, u) “Gy(p,g(ku),0, 4). (7.4.4) 
(3) Large x-dependent coefficients, as in (7.4.2), are now removed, so if g(Ky) 
is small, a low-order calculation suffices. 


This procedure makes it evident that the coupling that is relevant is the 
effective coupling at the scale of the momenta involved. 

It should be noted that we have related the large-momentum behavior of 
G, to the finite-momentum behavior of the zero-mass theory. It is therefore 
crucial that the zero-mass limit exists. However this limit does not always 
exist: if we use mass-shell or zero-momentum subtractions, then we see 
from, for example, (3.4.7), that the same self-energy as considered in (7.4.2) 
diverges as m0. Now Weinberg’s theorem tells us that, in the dominant 
momentum region for a graph without counterterms, all lines are far off- 
shell; hence masses can be neglected. So the problem must be that with 
mass-shell or zero-momentum renormalization prescriptions, the counter- 
terms diverge as m— 0. This is easily checked from our explicit calculations 
(see (5.10.2)). 

We can now see the practical importance of the theorem whose proof was 
summarized at the end of Section 5.8, that the counterterms may be chosen 
polynomial in masses. It ensures that the zero-mass limit may be taken 
directly and used to compute large momentum behavior. The minimal 
subtraction scheme is one way of ensuring that counterterms are poly- 
nomial in mass. 

If one uses, say, zero-momentum subtractions, large-momentum be- 
havior may be computed by changing renormalization prescription to, say, 
minimal subtraction. Another approach is to observe that the logarithms of 
p° break a possible symmetry of the theory under scaling transformations. 
The consequences of this point of view were worked out by Callan (1970) 
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and Symanzik (1970b). They derived the Ward identity for scaling 
transformations. It is called the Callan—Symanzik equation and looks 
similar to the RG equation. This equation may also be used to discuss high- 
energy behavior. 


7.4.1 Generalizations 


The behavior of Gy when all momenta are scaled by a large factor x is not 
normally experimentally relevant, for all the external momenta are then far 
off-shell. In coordinate space the corresponding region is the short-distance 
limit of Gy(x,,...xy), where every x;— x, is made small: x,—x,- 
(x j X,)/K. 

This means that we should be able to use RG methods to discuss the 
renormalization of the theory, for renormalization is a purely short- 
distance phenomenon. We will work out the details in Section 7.10. On the 
other hand, physical experiments involve long distances. To get results for 
high-energy experiments we need the so-called factorization theorems. The 
simplest of these is the operator-product expansion which we will treat in 
Chapter 10. These theorems typically give a cross-section as a product ofa 
factor which can be computed by pure short-distance methods and of 
simple factors related to wave-functions of the incoming and/or outgoing 
particles. 

We could also use RG methods to discuss the infra-red limit x — 0. This is 
only useful if masses can be neglected. Certainly this is true in a purely 
massless theory, as we will see in Section 7.5.4, and then IR behavior is 
computable if and only if the theory is not asymptotically free. But in a 
massive theory, it is not useful to take much less than a typical mass, for 
one obtains logarithms of m/y, and these prevent a simple use of 
perturbative methods when u <& m. 


7.5 Varieties of high- and low-energy behavior 


7.5.1 Asymptotic freedom 


In solving the RG equation to obtain high-energy behavior, we find two 
cases according to whether f is positive or negative. In this section we 
discuss the asymptotically free case, when B is negative. Suppose the 
effective coupling g(u) is small for one value of u, so that B is well 
approximated by its first term. Then the evolution equation (7.3.4a) shows 
that g becomes even smaller at larger values of u and in fact goes to zero as 
u— oo. Thus perturbation theory is reliable for computing high-energy 
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behavior. From (7.3.7) we see that ø? in six dimensions is asymptotically 
free. 

It is instructive to compute the behavior of g from the first two terms in £. 
Let us define 


B=-—A,g? —A,g°+ Olg’). (7.5.1) 


We have the equation for the evolution of the effective coupling: 


d 
n= = Bio). (7.5.2) 
u 


Equations of the same form as (7.5.1) and (7.5.2) hold in any renormalizable 
theory, for example in QCD, though 4, and A, are not necessarily positive 
in the general case. 

The solution of (7.5.2) is 


glu) 
In u = constant + | dg’/B(g’) 


giu) 1 A, 0 
= constant + d + 
g | : las go ar j | 


= constant + —_——_} ve T AT +73 A, an [g(u)] + O(g?). (7.5.3) 


The constant can be computed from a oar of g(u) at one value of u. It 
is conventional (Buras, Floratos, Ross & Sachrajda (1977)) to write the 
constant in the form In A + 4.4,Aj; *In(A,), where A is a parameter with the 
dimensions of mass. Then 


In(u2/A?) = r” [A ,.9(u)7] + Olg’), (7.5.4) 


Agu 
so that 
1 A, In [In(u?/A7) | In?[In(y/A) ] 
A= TES STGP Ff a S OS 


A specification of g(u) at one value of u is exactly equivalent to a 
specification of A. The precise choice of the scale of A is that in (7.5.5) the 
omitted terms are of the order shown rather than of order 1/In?(u/A). The 
expansion (7.5.5) is much used in QCD. A higher-order calculation of g can 
be made from the following form of the solution 


1 A, 
In(u?/A?) = 75 rg tgr Mg] 


glu) 1 e A 
2 d ee 7.5.6 
j p ý | gant Ze aria a) 
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It is necessary to go to two-loop order to obtain both the terms on the right 
of (7.5.6) that diverge as g — 0. 
The values of m(u) and ¢ may be similarly calculated. For example, if 
y= Cig + (7.5.7) 
then 
’dg C, 


Cu’, =expf S oa. — + O(g "| 
= exp it In = glp) + ol) | 


g(u) 
1/24; 
= e] [1 + 0(9’)] 
oc [In(u’/p)]~%/441, as p' >. (7.5.8) 


-7.5.2 Maximum accuracy in an asymptotically free theory 


The results above enable calculations of Green’s functions to be made at 
high energy. By taking more and more terms in the series, we may improve 
the predictions. However, in general, perturbation series are asymptotic, 
not convergent. A trivial example is the ordinary integral 


oc 


I(g,m) = m(2n)~ a dzexp(— m?z?/2 — gz*/4!) 


This can be considered to be a functional integral in Euclidean ¢* field 
theory at zero space-time dimension with the normalization chosen to give 
I = 1 when g =0. The perturbation expansion is 


I ~ m(2n)7*? M dzexp(—m am L) a 


N= 
-$ aee 1/2) 


Noo \6m* F(N + 1) 
= (5) i. (7.5.9) 


Now 
LANN 
TES (mN) + [1 +0(1/N)], as N—=>œæ, (7.5.10) 


so the series is divergent. The divergence is associated with the fact that the 
defining integral diverges if g is negative, so that J is not analytic at g = 0. 
The corresponding property of ¢* theory is that the Hamiltonian is 
unbounded below (i.e., the vacuum does not exist) if g is negative. 
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What we can say is that we can approximate I by truncating the series: 


Nm 
I= ¥ (g/m Ip + O[G/m*)***]. (7.5.11) 
N=0 
The error is estimated by the first term omitted, i.e., 


OUy, + (g/m*)%m* 4], 
These results are standard in the theory of asymptotic expansions for simple 
integrals. All experience, together with rigorous theorems for quantum 
mechanics and super-renormalizable field theories (Glimm & Jaffe (1981) 
and references therein), indicate that this behavior is typical for functional 
integrals in a non-trivial dimension (1.e., d > 0). 
Next, let us suppose we wish to compute some quantity in an 
asymptotically free theory by truncating its perturbation expansion 


N m 
2 glu)" Iy. 
N=0 


We assume that the quantity depends on some momentum p, and that we 
set the unit of mass u to be of order p. A case would be the propagator with 
u = O(|p|). Suppose that the coefficients in the expansion behave like 


Iy ~ NNbN N@d[1 + O(1/N)], (7.5.12) 
for large N. What is the best accuracy with which we can calculate the 
quantity ? This is given by the minimum error, i.e., the minimum of I yg?" as 
N varies. The result is that the minimum possible error in a perturbative 
calculation is of order 


constant | p|~ 741/*(In|p|)*. 


This means that beyond a certain level, power-law corrections to the 
asymptotic behavior computed in perturbation theory are meaningless 
since they are smaller than the irreducible error in using perturbation 
theory. Power-law corrections are those that are a power of p? smaller than 
the leading term. 


7.5.3 Fixed point theories 


In four dimensions, the only theories that are asymptotically free are non- 
abelian gauge theories with a small enough number of matter fields — see 
Coleman & Gross (1973) and Gross (1976). Other theories, like ¢* and 
QED, have an effective coupling that increases with energy. Thus, in such 
theories it is impossible to compute the true high-energy behavior by 
perturbation theory. (Note however that the coupling in QED is a/z 
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B(g) 


g* 
Fig. 7.5.1. B(g) in a non-asymptotically free theory with a fixed point at g = g*. 


~ 1/430. This is very small, so the non-perturbative region in QED does 
not occur until very many orders of magnitude beyond experimentally 
accessible energies.) 

An interesting possibility is that B(g) has the form shown in Fig. 7.5.1, 
with a zero at g = g*. Then g(u) approaches the ‘fixed point’ g* as y> œ. At 
large momentum Green’s functions behave like 


Gy(KP1,-++5KDys 9, M, p) ~ const K4imen+ N92 G (p,,..., Py, g*, 0, u). 
(7.5.13) 


This behavior is as if @ had an extra term y(g*)/2 in its dimension. 
Consequently, the function y(g)/2 is called the anomalous dimension of the 
field œ. 


7.5.4 Low-energy behavior of massless theory 


If m = 0, then the renormalization group can be used to compute infra-red 
behavior. The calculability is the opposite of that for the UV behavior. 
Consider first an asymptotically free theory. There, the effective coupling 
g(u) goes to zero when p goes to infinity, so that short-distance behavior is 
computable perturbatively, as we saw in Section 7.5.1. But, when pis small, 
g(u) is large, so the infra-red behavior cannot be computed reliably by 
perturbation theory. (This is the case for strong interactions, according to 
QCD.) 

Let us now consider a non-asymptotically free theory. For large u, the 
effective coupling is large, so the short-distance behavior is not per- 
turbatively computable. (For example, a perturbative calculation in low 
order of the position of the fixed point, g*, in Fig. 7.5.1 and of the value of 
B(g*) is subject to large errors from higher-order uncalculated corrections.) 
But when u goes to zero, so does the effective coupling. We are assuming the 
absence of a mass term for the field, so there are no large logarithms of m/y 
as u goes to zero. Hence, we can compute IR behavior in such a theory, just 
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as we computed UV behavior in an asymptotically free theory. We will now 
do this. 

Now, for almost any graph there are large logarithms of p?/u? as p? > 0 
in a massless theory, just as in the ultra-violet. In the case of the propagator, 
these logarithms mean that the propagator’s singularity is not a pole, at 
least order-by-order. | | 

To investigate this singularity let us again write 


yg)=C,9°, B@)=— A419”, (7.5.14) 


using the same notation as before, but now with A, < 0. We assume that g is 
below the first non-zero fixed point g*, if there is one. The propagator is 


g(k 


H) 
G (kp;g(u), u) = K`’ Ğ,(p;g(ku), u)exp | | dgy(g)/B(g) 


g(u 


=i/(k°p°)[1 + Olgu?) exp} ~ | dg(C,/A,g)[1 + ow) 
~ i/(k?p*)-constant:[In(1/«) ]@/?“", (7.5.15) 


as k +0. Hence if C, is non-zero, then p?G,(p) does not have a finite non- 
zero limit as p-»0; the singularity of G,(p”) at p? = 0 does not correspond 
to a simple single-particle pole. The massless particle that gives rise to the 
singularity cannot be treated as an ordinary particle, because its long-range 
interactions are too strong. Positivity of the metric of the state vectors 
constrains C, to be positive, and we assumed a theory with A, negative, so 
k? times the right-hand side of (7.5.15) goes to zero. (The positivity 
argument is the one given in the textbooks (e.g., Bjorken & Drell (1966)) 
that the residue of the pole in a propagator is less than unity if the 
propagator is of the canonical field. Application of this argument in the 
theory with an ultra-violet cut-off shows that the divergence of the self- 
energy must be such that C, is positive.) 

Notice that if C, = 0, then the propagator does have a finite residue at 
p=0: 


R glu) 
~ix?p? Gate. ols, 4)-rexp4 — | dgn} (7.5.16) 


0 


Fig. 7.5.3. Lowest-order self-energy 
Fig. 7.5.2. Lowest-order self-energy graph that contributes to the 
graph in ¢* theory. anomalous dimension in $* theory. 
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Observe that only the first term in y is relevant to the finiteness of the limit, 
and that the limit does not exist order-by-order. A case where C, = 01s the 
pt theory, because the one-loop graph Fig. 7.5.2 is independent of p. The 
two-loop graph Fig. 7.5.3 provides the lowest-order term in the anomalous 
dimension. 


7.6 Leading logarithms, etc. 


7.6.1 Renormalization-group logarithms 


We saw in Section 7.4 how to compute the large momentum behavior 
of a Green’s function by approximating it by a Green’s function with 
m=Q. Then we used the renormalization group to reorganize the 
perturbation series into a form with small coefficients. It is of interest to 
examine how the complete result can be obtained by a systematic 
resummation of the perturbation expansion. 

For concreteness, let us examine the propagator G (p, g, u) in massless o? 
theory in six dimensions. We write its perturbation expansion as 


G.=(/P’) X}, 9°"T,(- p/u’), (7.6.1) 


where the lowest coefficient is T, =1. We will prove that each T, is a 
polynomial in In u (and hence in In(— p?/u7)) of degree at most n, with n 
being the number of loops. To do this we will regard the RG equation 
(7.3.15) not as an equation to give the variation of G, when p is changed 
with g set equal to the effective coupling g() (thus keeping the theory fixed), 
but as an equation for the -dependence of G, with g fixed. Picking out the 
order g?” term gives 


ô ð n—1 
Toe _ = 2 T, 2n’ 7.6.2 
oln H ý ‘| (9) por P ni9 of g”. 


Since yis O(g?) and B is O(g?), this equation determines T, in terms of lower- 
order T,,’s: 


T,, = constant 


ð Inu 
+ l > = [7o + soz l| din p’T,,.( — piu) 


coefficient of g?”. 


(7.6.3) 


Iteration of this procedure another n — 1 times gives Tin terms of T, and n 
constants of integration. Evidently T, is a polynomial of degree n in In y as 


n <n (0) 
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claimed: 
T,= > T,,,-,[In(- p?/p7) J" (7.6.4) 
1=0 


All but the constant term are determined in terms of lower-order 
coefficients. 
A convenient way of organizing the series is to define for each term 


L= — number of logarithms + number of powers of g? = —I+n. (7.6.5) 


This is non-negative. The sum of the terms with L=0 (viz, 
T,,o{In(— p?/u7)]") gives what is called the leading logarithm approxi- 
mation to G,. Application of uð/ôu, Bd/dg or y to G, (with one-loop values 
for B and y) increases L by 1. All the non-leading logarithms give even 
higher values of L. So the leading logarithm series exactly satisfies the one- 
loop approximation to the RG equation. Hence we may sum the leading 
logarithm series by solving this approximation to the RG equation 


| VP du’ 
G4(P7:9,4)1-o = }ex0] | “rolu » ë (PIN — PP -7>} 


u L=0 
= [1 + 4,g°ln(— p?/p*) ]61/2411/p?. (7.6.6) 
Here we used the same notation as in Section 7.5.1. This equation 
reproduces the approximation derived at (7.5.8). 
Another way of treating both the leading and non-leading logarithms is 
to use the RG equation (7.6.2) to give a recursion relation for the T 


T 2(L—n)T,, ,[In(— p?/u’) J": 


Bp aaiie TT atia 
+ [24,(n—2)— Cy] = To 2 {In(— p2/e2) 2-4 
7.6.7) 


where y = Cig? + C,g*+-:-, and B= —A,g*—A,g?+°" 
The leading logarithm part of this equation is 
—2nT, o =[2A,(n—1)—C,]T,-10- (7.6.8) 
This equation determines the leading logarithm series in terms of Ty, = 1, 
and of A, and C,; this series sums to (7.6.6). 
Equation (7.6.7) also determines the non-leading logarithms. For 
example the next-to-leading terms are 
—2(n—-1)T, 1 = [2A ,(n— 1)—C,]T,- wir [2A ,(n — 2) — C,]T,-2 0 
(7.6.9) 
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n — L = logs 


P 

V y 
V/Z 
V 


n = # loops 


Fig. 7.6.1. Illustrating the leading logarithms and non-leading logarithms of a 
Green’s function. 


Again a convergent series results. Its sum is equally accurate as the result 
of using the two-loop approximation to B and y in the solution (7.3.21). 
There pis set equal to ( — p?)!/? and we take the one-loop approximation to 
G,(p;g(./ — p°), m=0, \/ — p°), i.e., (i/p?) (1 +g°T;,). 

The series for larger L may be similarly determined. In Fig. 7.6.1 we 
illustrate the structure of the calculations. The diagonal lines are lines of 
constant L, and the recursion relation (7.6.7) determines a coefficient T, , in 
terms of lower-order terms on its diagonal and on the higher diagonals. 

Suppose we have computed perturbation theory to n— 1 loops for G, 
and wish to compute the n-loop term. In this term the coefficients of all but 
In( — p*/y?) and the constant are fixed by the lower-order calculations. 
Thus the new information is in the nth order coefficient C, for y and in the 
terms with one and no logarithms, i.e., in T, „- , and T, ,. The (In)° term in 
(7.6.7) is 


n-1 
—2T, ,-1=—-C,+ > [24,- Jf —C,- JT); (7.6.10) 
j=l 
This shows that knowing the coefficient, T, ,_,, of the singly logarithmic 


term in T, is equivalent to knowing the n-loop coefficient C, in y(g). 

Exactly the same procedure may be applied to any Green’s function Gy. 
The only difference is that there are several external momenta. It is enough 
to consider the connected graphs. Let us write 


Gy (connected) = (p)? g- SS GM In" "( — p?/y?)g?", 
n=0 L=0 
(7.6.11) 
where p is one of the external momenta and we have factored out g”~?, 
which is the power of g appearing in the tree approximation. The 
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coefficients G™} are now functions of the dimensionless ratios of the 
Lorentz invariants formed from the external momenta. 

The leading logarithm series and all the non-leading series are con- 
vergent, so they can be summed. The n! behavior of large orders only 
appears when we consider the single log and constant terms, and thus in the 
sum over L. 


7.6.2 Non-renormalization-group logarithms 


In all of the above cases there was one logarithm of the large momentum per 
loop. There are more complicated situations where not all invariants get 
large. A simple standard example is the form-factor of the electron in QED 
with a massive photon (Fig. 7.6.2). Here q? = (p, — p,)? gets large but p? 
and p? are fixed. It turns out (Sudakov (1956), Jackiw (1968)) that there are 
two logarithms per loop. These must be in the coefficients G®} in (7.6.11), 
since the power of the logarithms is too high for them to be the explicit 
logarithms in (7.6.11). 


Fig. 7.6.2. The electron’s form factor in QED. 


One would like to find the large-momentum behavior in such situations. 
A much-used technique is to sum the leading logarithms, which are often 
relatively easy to compute. For the on-shell form-factor this gives a 
convergent series which sums to (Jackiw (1968)) 
F ~exp[ —(e?/1627)In*q7 J. (7.6.12) 
(See also Mueller (1981). For the simple cases that we considered earlier, 
with all momenta large, the leading logarithm approximation 1s justified by 
renormalization-group methods, as we have seen. For cases like the present 
one of the form-factor, it may be a bad approximation (Collins & Soper 
(1981, 1982b)). However methods are available to obtain large-momentum 
behavior in some of these situations. See Mueller (1979, 1981) and Collins 
(1980) for the electron form-factor and Collins & Soper (1981) for cases in 
strong interactions. 


7.6.3 Landau ghost 


The leading logarithmic approximation (7.6.6) for the propagator has a 
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singularity when 


p’ = —p’ exp[ — 1/(A,g’)]. (7.6.13) 


This singularity, if present in the true propagator, would signal a state of 
this value of mass squared. Since the residue has the opposite sign to that for 
a normal propagator pole, this would be a state with unphysical properties. 
It is called the Landau ghost (Landau & Pomeranchuk (1955)). In a non- 
asymptotically free theory like QED, it occurs at very large energies and in 
an asymptotically free theory like QCD, it occurs at low energies. In either 
case it occurs where perturbation theory is inapplicable and so where the 
leading logarithmic approximation is a bad approximation. 


7.7 Other theories 


We restricted our attention in deriving the renormalization group equation 
to a theory with one field, one coupling, and one mass parameter. However 
we may treat, by exactly the same method, a theory with several fields 
f~1,--.,@, (each may be Bose or Fermi), several couplings g,,...,g,, and 
several masses. A change in the unit of mass u is compensated by a change in 
each of the parameters and in the scale of the fields. The main problem is a 
proliferation of indices. It is easiest to treat couplings and masses on the 
same footing. So we have a collection g,,...,g- of renormalized parameters, 
with C being the total number of couplings and masses. Then we must write 


d 
uI ju) = P (8), (7.7.1) 
u 


each function £$ , being, a priori, a function of all the parameters. For the case 
of a theory with a single coupling, and a single mass, would have (g1, g2) = 


(g,m*), and $, = B(g) and B, = — y,,(g)m’. 
Given a function f of the renormalized parameters and of u, we have 


d ô Ô 
Hig & 4) = (e F e) (7.7.2) 


The RG coefficients can be determined by noting that the bare couplings 
J o) are invariant: 

d 
HTI p,d)=0. (7.7.3) 
These form C equations for C unknowns. 


The RG equations for Green’s functions are complicated by the 
possibility that fields of the same quantum numbers may mix under 
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renormalization. Writing 
Poi = » Cg, d) ;, 
J 
we find, for example, that ‘kinetic energy’ terms in ¥ are of the form 
7), Ob (oi = 2) 06 0o1), Ci bu = 22 06 09, Z j- 

l J» l ’ 

Hence, we have a matrix counterterm for the field-strength renormalization 


Z; (2, d) = X Cust; zE (€°C);;, (7.7.4) 


where ' denotes transpose. (Note that ¢ has a different meaning here than in 
Sections 7.1 and 7.2.) 
If we define a matrix anomalous dimension by 


d 1 
—¢.=—1Y y..d. 7.7.5 
Ma 2 Ibs ( ) 
then invariance of the bare fields gives 
d d 
0= Ma PO = (unt E ier (7.7.6a) 
1.€., 
d 
iroa 
aby Mane? 
or 
d 1 
u 


If Z is diagonal (as is the case in most theories we consider): Z; ; = ô;;Z;, then 
(7.7.6) reduces to an anomalous dimension for each field 


d 
Me = —37:9;, (7.7.7) 
u 


A Z 
Mau n i— Vir 


In the case of a diagonal Z, the renormalization group equation for an N- 
point Green’s function is 


d a 2 ô | 
—Gy =] u— + (g)=— |G 
Pdu N È 2 PD | N 


=— ¥ 27, (8)Gy. (7.7.8) 
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Here y; 1s the anomalous dimension function for the «th external field of 
Gy. If the renormalization matrix Z is not diagonal, then we have a similar, 
but more complicated, set of equations for the Green’s functions. 

The equations for the evolution of the couplings are coupled, so their 
solution is in general complicated. Considerable simplification can be 
achieved by using our knowledge of the dependence of the counterterms on 
massive couplings. Let the couplings g,,...,g, be dimensionless and let the 
corresponding bare couplings be 


(4—d)p; 


H Joji (g.d), 
where the gio; depend only on the renormalized dimensionless couplings 
and on the UV cut-off. For the sake of definiteness, we assume that the 
physical dimension of space-time in the theory is d = 4. The wave-function 
renormalizations Z, also only depend on the dimensionless couplings and 
on d. Let the other parameters (masses and super-renormalizable coup- 
lings) be denoted by f,, and let the dimension of f, be (4 — d)t, + o,. If f, is 
the mass of a fermion, then c, = 1, t, = 0. If it is a boson mass squared, then 
6,=2 and t,=0, while for a super-renormalizable coupling t, #0 and 
o, > 0. Then (by Section 5.8) the bare quantity corresponding to f, has the 
form 

Jos = >, pe O° X F yg, d). (7.7.9) 

dy =Gs 
Here X is any product of the dimensional couplings with dimension (at 
d = 4) equal to the dimension ø, of f,. 

Requiring invariance of the bare couplings gives 


A 
0 
0=(4-— A) P:I.oyi al > Pizy Jow (7.7.10a) 
j=1 ` 0j 


A 0 
= > Biz, In Z (7.7.10b) 
J= j 


C 
X X,j Jj 


pS BT TA (7.7.10¢) 
t,X t 
There is then a triangular structure to the evolution equation: the evolution 
of a coupling depends only on couplings of the same and lower dimensions. 
In these equations, the index j runs over the values 1 to A, i.e., over those 
values that correspond to the dimensionless couplings, while the indices s 
and t run over the labels for the dimensionful couplings. 
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If we use minimal subtraction, the calculation of the coefficients 1s rather 
easy. Let G,(g), Z(g), and F sx(9) be the coefficients of single poles in giob Z; 
and F x. Then we have 


p;(g,d) = (d — 4)p g; + B,(g), 


B,(g,f,d) =(d — 4)t,f, + B,(g.f), (7.7.11) 
with »,, B; and B, satisfying 
A 
B; = p:G; pa Pif ig 1? 
j=1 j 
A 
0 
Vi 3 Pitiag, i’ 


7.8 Other renormalization prescriptions 


It was only for the sake of simplicity that we restricted our attention to the 
minimal subtraction procedure. The proof in Section 7.2 in fact shows that 
any change in renormalization prescription can be compensated by a 
change in renormalized parameters and a change in the scale of the 

renormalized field. Let us examine what happens in more general schemes. 
It is sufficient to restrict our attention to a theory with a single coupling and 
mass, like ø? theory in six dimensions. 

If we choose a renormalization scheme with an extra mass u, which might 
be a renormalization point, then renormalization-group coefficients can 
still be defined and computed by (7.3.4), (7.3.5), and (7.3.14). What we lose in 
general are: 


(1) the simple formulae (7.3.12), 
(2) the lack of dependence of p, y,, and y on the masses. 


In order to discuss UV limits, it is sensible to choose a scheme in which the 
limit m—0 exists. This means that $, y,,, y are finite, order-by-order, as 
m— (0. 

Now, different renormalization schemes are related by finite renormali- 
zations of the parameters. So we may relate the RG coefficients in different 
schemes by looking at the theory in the physical space-time dimension and 
- then computing pd/dy in one scheme in terms of ud/du in another scheme 
with the aid of the chain rule. 

Suppose we have a second scheme in which the new mass, coupling and 
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field are m’, g' and @¢’: 
g' = g'(g,m*/n"), 
m? = m?z,,(g, m? /p?), (7.8.1) 
b’ = oC(g,m*/p”). 
Then the Green’s functions in the new scheme are 
Gy(p3g',m’, u) = CX Gy(p3g,m”, p). (7.8.2) 
The renormalization-group coefficients in the new scheme are: 
d 0 ô 
p'(g',m?/ p?) = ug =(u ead = }9'(g.m’/u"), (7.8.3) 
fdu "Ou om 


A 12 = OK dg | 
alg’ m? uE — 


A C À 
= Ym — ( = + Bx — ym | In [z,,(g,m7/u7)]. (7.8.4) 


Our definition of the total derivative d/dy is as the derivative with respect to 
u when the bare coupling g, and bare mass mą are held fixed. Therefore, it is 
the same in both schemes. Notice that there are two steps in computing f’ or 
Ym: First, compute the right-hand side expressed in terms of g, m, and u; 
second, change variables to the new coupling and mass. 

The anomalous dimension of @’ is obtained as 


nus ping; 
y'(g',m°/y Nagin Gs 


d ¥ 
= eH ea 


asd a ee ERT m?/u?). (7.8.5 


A considerable simplification occurs in relating mass-independent 
schemes. Then g’, ¢ and z„ are functions of g alone, so that the 
renormalization-group coefficients in the new scheme satisfy 


EE, 
BIg) = Pa g), 


cm Zm(Q)s (7.8.6) 


Ym(G) = im — BE 


O 
rg)=y- 2B a7 Int (9). 
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In this case, let g’, z,, and ¢ have perturbation expansions 
g=gtag ta gt, 
Zm = 1+b,9g7 + b.9* +°°:, (7.8.7) 
€=1+c¢,97+¢.9*+---, 
and let the expansions of f, y,,, and y be 
B(g) = — A,g° — A29? — A3g'-"*, 
Vag) = Bg? + Bgt +>, (7.8.8) 
x(g) = Cig? + Cag + °° 
The expansions of f’, yn, and y’ are written similarly with all quantities 


primed. Then we can express them in terms of g by using (7.8.7). For 
example: 


B'(9(9)) 


—Ajg(l+a,g? +a,gt++::-—Ajgi(l+a,97+-:))-Agg’? +:°: 
— Ag? — g*(4, + 3a,44) — 9(A5 + 5A5a, + 3A\a, + 34,02) +. 
(7.8.9) 


This must agree with the perturbation expansion of the right-hand side of 
(7.8.6) 


Bdg'/0g = — Ag? — g°(A, + 3a, 4) — g (43 + 34,0, + 5A a) 4°". 
(7.8.10) 


From these equations we see that the first two coefficients in B do not 
change when the renormalization prescription is changed, 1.e., 
A, = A}, A, =A). By generalizing the above equations to all orders we also 
see that, by adjusting the terms in the expansion of g’(g), we may choose the 
terms beyond the second in f’ to be whatever we want. In similar fashion we 
see that only the O(g?) terms in y„ and y are invariant. 

Note that if the one-loop term in y or y,, is zero then the whole of y (or Ym 
respectively) may be made zero by a choice of renormalization prescription. 
This privilege does not extend to $: if the first non-vanishing term in f is at 
n-loop order (n > 1) then that term is RG invariant (but not the (n + 1)-loop 
term). 

In a theory with more than one dimensionless coupling we may try to 
apply the same methods. This is left as an exercise. It will be found that only 
the first term in each $ is now invariant, except in the case that the one-loop 
B-function does not mix the different couplings. 

The invariance of these coefficients only applies within mass- 
independent renormalization prescriptions. If one were to use, say, on-shell 
subtractions, then the parameter u would not appear, so all derivatives with 
respect to u are zero. Then we have B=y=y,=0. (The asymptotic 
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behavior that we extract by varying u can no longer be computed by 
renormalization-group methods, if we stay within this renormalization 
prescription. In this case the Callan—Symanzik equation must be used 
instead — see Callan (1970) and Symanzik (1970b).) 


7.9 Dimensional transmutation 


Consider a renormalizable field theory with one dimensionless coupling g 
and no masses. A physically important case is QCD with several flavors of 
massless quark ; with two or three flavors this is an approximation to actual 
strong interactions. 

Since the basic theory has no masses we must use a renormalization 
prescription with an arbitrary renormalization mass u. Although the theory 
apparently has two parameters, g and u, we saw that this is not so: a change 
in u can be compensated by a change in g. In fact, as Coleman & Weinberg 
(1973) pointed out, the theory really has no parameters at all. The point is 
simple but somewhat elusive, so we explain it at length. 

A physically measurable quantity must be renormalization-group 
invariant. For example, let M(g,) be a particle mass. By dimensional 
analysis, it is u times a function of g alone. So 


d Ô 
0=p—M=M+B—M. (7.9.1) 
du ôg 
Hence 
M- tant l i zl 
= w'constant:exp| — | —— 
B(g’) 
1 A é ee aA 
= wCexp 2 — 2 A dg ee 
rd ea O 
1 A 
= wCexp ¢ — —— —-2] O(g?) s. 7.9.2 
m pd 24g A n (g) + o>} (7.9.2) 
Here C is a constant and we have written B(g) = — A,g° — A.g° + °°, aS 


usual. 

Note that the Green’s functions are not renormalization-group in- 
variant: to measure a Green’s function, one must define the field operators. 
This definition has an arbitrariness, which is the freedom to vary its scale. 

The formula (7.9.2) has a number of consequences: 


(1) Non-zero particle masses cannot be computed in ordinary perturbation 
theory (in a theory with no mass in the Lagrangian). For to avoid large 
logarithms one must set u to be of order M, where M is the particle mass 
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being computed. Then (7.9.2) tells us that g(M) is not a free parameter; it 
is a number of order unity. 

(2) In a non-asymptotically free theory (A, < 0), suppose we have a small 
value for g(u). Then u < M, where M is the value of the mass of any given 
massive particle. Perturbation theory is therefore only useful for 
Green’s functions when the external momenta are much below the 
threshold for producing any of the massive particles. 

(3) In an asymptotically free theory (A, > 0), we have u > M whenever g(t) 
is small. Perturbation theory is useful in such a theory only when 
momenta are much bigger than particle masses. 

(4) Since the g-dependence of (7.9.2) is universal, i.e., the same for all 
particles, ratios of particle masses are pure numbers independent of g 
and u. 


Let us emphasize once more that these results are true when there are no 
explicit mass terms in the Lagrangian. 

The observation of Coleman & Weinberg (1973) comes from asking what 
can be measured in the theory. Suppose we start with u = u, and g = g, and 
ask how the theory changes when we work with the theory with a different 
value of g, g = g,.(We suppose g, and g, are between g = 0 and g = g*, the 
first fixed point of f.) Each version of the theory has an effective coupling 
satisfying 

Jvers1(H1)=G1> Ivers 2(M1) = 92- 


Now evolve g(u) in the second version to the value of u where 


Ivers (M2) = 91- 
Then the second version of the theory is just the first version with all 
momenta scaled by a factor u,/u,. For example let o be a cross-section 
depending on momenta p,,..., py. Then RG invariance and dimensional 
analysis give us 


O(Py,--->Pn3 92 Hy) = O(P1,--->PniG1> H2) 


dima 
H2 Hı Hı 
p= OLD ial Prin 7.9.3 
(=) (2r, las ) Oe) 


The last factor is the cross-section in version 1 of the theory, with its 
momenta scaled. 

We see that changing the dimensionless coupling in a massless theory 
does not basically change the theory, but only its mass scale. This is called 
dimensional transmutation. 

There are many ways of specifying the scale of the theory: in QCD one 
might give the proton mass. For perturbative purposes it is better to use 


7.9 Dimensional transmutation 205 


something that can be directly used in perturbation theory, for example the 
value of u at which g(u) has some given value (e.g., 0.1) in one’s chosen 
renormalization prescription. One standard way is to notice that for large u, 
g(u) has its asymptotic behavior given by 


ise a oe 
ow A, In (7/5) A} In?(u?/u2) In2 (u/n) 
In? (In a 
Ol ain) | 7.9.4 
| In® (u/Ho) ( ) 


Here py is a reference value of u. If pọ is changed then the series is 
reorganized; only the first two terms are unchanged. As is conventional 
(Buras, Floratos, Ross & Sachrajda (1977)), we define the scale A of strong 
interactions as the value of uo for which the 1/ln?(u?/u2) term is zero. This 
gives (7.5.5). 

If we change from, say, minimal subtraction to momentum-space 
subtraction, then the theory is unchanged provided the coupling is 
adjusted. This may be done in perturbation theory. For example, we might 
find that g in the MS scheme and in the momentum-space subtraction 
scheme are related by 


Ims = Imom + 41Jmom `- (7.9.5) 
Now let gy, be given by (7.5.5) with A = Ayg, and let g mom be given by (7.5.5) 
with A = A,om- (We already know that A, and A, are the same in both 
schemes.) Substituting these expansions into (7.5.5) and requiring con- 
sistency gives 


Aus = IN ions exp (a,/A 1). (7.9.6) 


Notice that both A, and a, are obtained from one-loop calculations and 
that there are no higher-order corrections whatever (Celmaster & 
Gonsalves (1979)). 

An amusing consequence is obtained by substituting (7.5.5) for g in 
(7.9.2). Since M is independent of u we may let p— œ. The higher-order 
terms all go away and leave 


M = CA(A,)42/247, (7.9.7) 
1 


This equation is not very useful for performing perturbative calculations. 

If the theory is a complete theory of physics, then measurements of o and 
the p’s in (7.9.3) will be in terms of a standard of mass. This we may take to 
be the mass M of some particle (say, the proton). Let us now change the 
theory by changing the coupling from g, to g,, just as we did earlier. Then 
the standard of mass is multiplied by yu,/u,. So if we do experiments in 
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version 2 with numerical values of momenta equal to those in version 1, the 
momenta are actually increased by a factor u,/u,. Therefore, (7.9.3) tells us 
that ø gets multiplied by a factor (u,/u,)%™*. But its unit of measurement 
increases by the same factor, so the numerical value is unchanged. In this 
sense massless theories with different values of the coupling (or different 
values of A) are indistinguishable. This is perhaps the most important result 
of Coleman & Weinberg (1973). 

However, there are many experiments that claim to measure A. There are 
even some that give (without qualification) a single measured value of g. 
How can this be? The second problem is easy to dispose of. What is being 
measured is the effective coupling g in some renormalization scheme with u 
set to a value of the order of the energy of the experiment (typically in e* e~ - 
annihilation). Strictly one should specify not only the value of g but also the 
scheme and the value of u. Now the experiments are at around 10 to 
30 GeV, and A is at most a few 100 MeV. The variation of g over this range 
and the variations between the usual renormalization schemes are often no 
more than the size of experimental errors. So it is possible to talk loosely. 

However, we just asserted that massless QCD with different values of A is 
the same theory. The sense of a measurement of A is that we measure the 
numerical value of the ratio of A (defined by (7.5.5)) to a standard of mass. 
For the purposes of the argument, we may regard the standard as being that 
the nucleon mass is 939 MeV. In terms of dimensionless quantities the 
measurement is of the constant C in (7.9.7) when M is the nucleon mass. (In 
the MS scheme, we find that C is between about 5 and 20.) The non-zero 
masses of the quarks make a relatively small perturbation of the above 
argument. 

Notice that if we play God and double the size of A, then the size of the 
standard mass also doubles, so that numerical results of experiments are 
unchanged. 

In QED the situation is different. The electron has a mass, and its 
Coulomb field is classical at large distances. A mass-shell renormalization 
scheme is natural. Since there is a very important mass-scale, an unqualified 
statement of a measurement of the QED coupling, viz., e = (4n/137)!””, 
makes good sense. QED with a different value of e is a different theory, 
unlike QCD in the absence of quark masses. 


7.10 Choice of cut-off procedure 


It is very convenient to use dimensional continuation as an ultra-violet cut- 
off in perturbation theory. However, there is no known construction of a 
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complete theory in an arbitrary complex dimension, so one must beware of 
assigning too much physical significance to use of dimensional con- 
tinuation. This is especially true when we use minimal subtraction, which is 
a procedure that exploits the form of the cut-off dependence of the theory. 
However, the renormalized theory with the cut-off removed does not 
depend on the form of the cut-off. We saw this in our one-loop calculations. 
In general the fact is easiest to see by using BPHZ renormalization, in which 
an integrand is constructed that gives a manifestly convergent integral. The 
only freedom left is a change of renormalization prescription, otherwise 
known as a change of parametrization. 

In this section we will examine the renormalization-group properties 
when a different U V cut-off is used. For definiteness we cut off the theory by 
using a lattice, with spacing a. We consider any theory with a single 
dimensionless coupling g and a single mass m. It is, of course, possible to 
generalize to any cut-off procedure and to any theory. In general we will 
need a renormalization mass yp, in order that we can take the massless limit. 
The bare coupling go, bare mass mo, and the field-strength renormalization 
Z are written as functions of the finite parameters g, m and u, and of the cut- 
off a. Then the renormalized Green’s functions are written in terms of the 
bare Green’s functions 


Gy(X1,---,%y3g,m, p,a) = Z7 ™?(g,m, uw, aJGO'(x,,...5XN5Gos Mo, a), (7.10.1) 


and for them the limit a —> 0 exists. 

The renormalization-group structure is essentially unchanged. Let us 
again choose a mass-independent renormalization prescription, so that go, 
Z, and mo have the forms: 


Jo = Jo(9, Ha), 
Z = Z(g, pa), (7.10.2) 


mg = m?Z,,(g, ua) + a~ 7 Y(g, pa). 


The massless theory has m = 0, and, as before, g, and Z are independent of 
mass. But now the cut-off parameter is dimensional, so gy and Z have 
explicit dependence on u as shown. But the dimension of g, is fixed at zero, 
so the d-dependent power of u is not used. 

The m?-dependence of the bare mass squared is again linear. But it is no 
longer true that mọ =0 when m=0. In the case of dimensional reg- 
ularization the only remaining dimensional parameter is „u, and it is not 
possible (Collins (1974)) to generate by minimal subtraction a counterterm 
u? Y(g, d). But with a lattice cut-off a term a~ * Y is both possible and needed, 
as we will now verify by computing a low-order graph. 
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7.10.1 Example: pt self-energy 


The simplest example that shows the existence of the Y-term in (7.10.2) is the 
self-energy graph of Fig. 7.10.1, not in the ¢° theory that we have been 
using, but in the ¢* theory in four space-time dimensions (with m = 0). The 
Lagrangian is (2.3.1). With dimensional regularization the value of the 
graph is 


19 (2m) ~4 | d4k/k? =0, 


OU 


Fig. 7.10.1. Lowest-order self-energy graph in t theory. 


but with a lattice cut-off we find 


— igy(32n*)~' | d?k dw D(w, k; a). (7.10.3) 


|kH]| < n/a 
Here the Euclidean lattice propagator is 1/(w? + k?) if w and |k| are much 
smaller than 1/a. For general values of k“, it is 


a fa 5 sin? wan, 


=i 
which is positive definite, so that the integral (7.10.3) is non-zero and 
diverges to a number of order 1/a? as a0. 
A similar divergence occurs in the self-energy of a scalar field in any 
theory. 


7.10.2 RG coefficients 


We now continue our general discussion of the renormalization group 
when a lattice cut-off is used. As in the treatment using dimensional 
regularization we define a renormalization-group operator 


d ô ô | ,@ 
ey aa ay ar aT eT ee 7.10.4 
H du H au Bg pa) ag Vm(G> Ha) am? ( ) 


We have changed our notation slightly, and used an overbar to indicate 
renormalization-group coefficients in the cut-off theory. These coef- 
ficients J and y,, have finite limits, B(g) and y,,(g), as a— 0. In our later work 
it will be rather important to distinguish the coefficients before and after the 
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cut-off is removed. The coefficients can be computed from 


d d 
— =0= — m2 
Hau”? Man 


with (7.10.4) used for ud/du. We also have the anomalous dimension 


y= ud ln Z/dy, just as with dimensional regularization. This all results in 


0 
— (HA) Fa) 909 pa) 
ja ace 
gg 904 ua) 


i ô ð 
i= Hua) + B A In Z,,(9, Ha), (7.10.5) 


7 0 . 0 
y = azt + P= |i Z(g, pa). 


In addition there is the constraint 


d'.e siaa 0 0 
a =| mazes + BS [Yo qa (7.10.6) 


The information on the divergences is all contained in the finite functions 
B, Ym and y. If desired, we can use minimal subtraction with the form 


Jo =9 + 9°G,, In (ay) + g°[G,,In* (au) + G,,In(ay)]+---, 


so that B = —g°G,, —g°G,, —::: is a function of g alone. In order that 
B be finite as a—0 all the logarithms of ay must cancel in f. This implies 
a set of relations for the counterterms, the first of which is 2G,, = 
3G?,. An analogous set of relations occurs when we use dimensional 
regularization, as can be seen from (7.3.12). These we will discuss further in 
Section 7.11. Note that for minimal subtraction with the lattice cut-off we 
have B= B, Ym = Jw ete. 


7.10.3 Computation of g, and Z; asymptotically free case 


If we were to compute the exact theory, rather than a perturbative 
approximation, we would need to know how g,(g, ua) depends on a as a > 0 
with g and y fixed. A low-order calculation is not sufficient, for g, has large 
logarithms in its perturbative expansion. Provided the theory is asymptoti- 
cally free. we can remedy this by using the renormalization group to 
improve the calculation, just as we did for the large-momentum behavior of 
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Green’s functions. The starting point is the equation 
d C = O 
—go = U—In + BJ = 9, 7.10.7 


which is in effect the renormalization-group equation for g,. We may solve 
it just as for the Green’s functions. 

Ultimately, we will let a approach zero while holding g and u fixed. But 
first let us keep a non-zero. Then we can define an effective coupling g(u) by 


p'dg(u’)/duy’ = Bigle’), (7.10.84) 
with the boundary condition 
glu) =g. (7.10.8b) 


We will also need the effective coupling at a =0. For the moment, let us 
denote it by the symbol g(w’). It satisfies 


b'dg(u')/du’ = BG(u’)) = Bu’); u'a = 0), 
glu) = g. 
Implicitly there is dependence of g on u and g, and of g on u, g and a: 
9=Gu's9), J= J(u; u,a, g). 
Of course, g(u')—> g(u’) as a 0. 
We can solve the renormalization-group equation (7.10.7) for g, to find 
Jo = Go(9, au) = 9o(g(1/a), 1). 


Now, when a is small, it might appear that we can replace g(1/a) by g(1/a), 
and that g, is well approximated by the first term in its perturbation 
expansion (since g(1/a) is small). That is, 


Jo = G(1/a) + negligible error. 


These suppositions are actually false, for two reasons. First, B(g, ap) 
in general depends on ap, so we cannot just replace g at u = 1/a by g(1/a) 
computed in the a = 0 theory. Secondly, we cannot simply drop the higher- 
order terms in gp, since the dependence of renormalized Green’s functions 
on gy is singular. Thus small errors in g, may give rise to large errors in a 
Green’s function computed as a function of bare quantities. 

To derive the correct formula we must examine the a-dependence of 8 
more closely. So we write the perturbation expansion of g, in the form: 


Jo =g + g°[G,,In (au) + Gio + G, (ap) ] 


+ g°[G,,|In* (au) + G3, 1n (au) + Gj 5+ G,(aw)]+°°°. (7.10.9) 
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Here we have not specified the renormalization prescription, so in addition 
to the logarithms we need finite functions G, (au), etc. We have explicit 
constant terms G, so we define Gay) to be zero at au=0. Once 
divergences and subdivergences have been subtracted from Feynman 
graphs, the remainders converge with power-law convergence in momen- 
tum. This is a consequence of our treatment of Weinberg’s theorem, and is 
further treated in Weinberg (1960). Therefore we can say 


Gay) = O((az)") 
as au—0, for some positive number c,. In general, G; equals ay times 


logarithms of au, so we can safely set c; = 1/2. 
First we compute the f-function: 


0 > 0 J 
3 eae 5 S 
a Gee [+a |26zaIntam + Ga + 5a Oo |+ l 


{1 T 3g°[G, , In(au) + Gio t G,] eS 


G = 
=-g9 1G ee 


0 
_ 2 
—g foz —3G6,9G,, tau — 3G; — 3G, , G, In (ap) 


~ 36,061} Fe (7.10.10) 


The relation 2G,, = 3G?, must hold in order that f is finite as a0. The 
limit a— 0 gives 


B(g) = B(g,0) = — g°Gy, — 9°(Gy, —3G,9Gy,)+ °°, (7.10.11) 
so that with our usual notation 4, = G}; and A, =G,, — 3G,, Gio. Since 
G (ap) and G,(ap) go to zero like a power of ay (times logarithms) when ap 
— 0, their logarithmic derivatives auôG ;/0(ap) also go to zero like a power. 

The first step in our calculation of g, is to observe that the RG invariance 
of g, implies that 
Go(9s HA) = Jo(G(u), u'a) 
= g,(g(1/a), 1). (7.10.12) 
The next step is to examine the size of the error that is made in replacing 
g(1/a) by the effective coupling g(1/a) in the a=0 theory. Finally, we will 
find the accuracy to which g,(g(1/a), 1) must be computed in order to obtain 
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the correct renormalized Green’s functions at a= Q. 

The difference between the two effective couplings g(1/a) and g(1/a) will 
turn out to be of order g(1/a)? when ais small. So let us define the fractional 
error ĝ°A by 

glu’; u,a, g)= glu’ sug) [1 + ĝ(u'; u, g) Alu; u,a, g)]. (7.10.13) 
We will now show that A is finite when yp’ = 1/a and a0. 
From (7.10.13) and the definitions of 8 and $ we find that 


ô Pa AnA E EE 
we Me) =g [EA + g°A)@, w'a) — (1 + 3g°A)B@)] 


=e eae + O(ĝ? (u'a)! ?) + O(G*A). (7.10.14) 
O(u'a) 


Now g?(u')~1/A,In(y'/A) as u'—>œ, so this equation tells us that 
A(u’, u,a, g) is finite when a0 and u < yp’ < 1/a. In fact it implies that 
A(u’) = — G,(ap') + O(1/In(1/a)). (7.10.15) 


We now compute g,. It is convenient to write a formula for its square: 


9o(g, pa)” = 9o(g(1/a), 1) 
= {g(1/a)(1 + g°A) + g°(1 + APEG o + G,(1)] + OG°)}? 
E 1 Aln [In (1/a°A?°)] 
[A;In(1/a°A’)]  [A} In? (1/a°A?)] 
4: o 2o 
[4} In? (1/a?A?)] 
Here we used the formula for g(u) in terms of u and A — (7.5.5). The formula 
for g, in terms of 1/a and A is the same as (7.5.5) except for an additional 
1/lIn? term. Observe that it was essential to keep the a-dependence in 
B(g, az); the — G,(1) term in (7.10.15) canceled the G,(1) in the two-loop 
coefficient in gp. 
Finally we express (7.10.16) in terms of g, a and u: 


1 A, In(In(1/a?p7)) 


+ OfIn?[In(1/a)]/In3(1/a)}. (7.10.16) 


4019.0) = Aina?) ASIN aL?) 
1 1 A 
F TUEA 2 Z 4, nts") = 24, fa) 
+ OfIn2[In(1/ap) ]/In3(1/ap)}, (7.10.17) 
where 


f(g) = i dg'[1/b(g) + 1A,g"°) — 4,/(Aig’)]. (7.10.18) 
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Similar formulae hold for Z, for Z,,, and for Y. Thus 


Z=[A,g?In(t/a*p?) JOOP x 


> fag) C In(In(ay)) i 
x exp} | da F- + aie + of =C i 


(7.10.19) 


where y = C,g? + O(g*). 


7.10.4 Accuracy needed for go 


Let us now suppose we compute the renormalized Green’s functions: 
Gy(x1,.--, Xy 39, M, uza) = Z~™!? G ow(X1>- <- XN;JosMo;a). (7.10.20) 


We must now let a approach zero, and ask how accurately we need to 
compute g, and Z. In (7.10.17) and (7.10.19) we gave formulae for g, and Z, 
with explicit estimates for the errors coming from uncalculated corrections. 
These equations tell us the value of gọ(g, au) when we let a > 0 while keeping 
g and u fixed. Since the bare Green’s functions have singular dependence on 
J, the uncalculated corrections might affect the values of the renormalized 
Green’s functions. In fact these terms do not affect the renormalized Green’s 
functions in the continuum limit, as we will now show. 

The key observation is that the renormalized Green’s functions are finite 
functions of the renormalized parameters. Thus we do not need to hold the 
renormalized coupling and mass fixed while taking the continuum limit 
a—>0. We may in fact let them vary continuously, provided only that their 
values at a = 0 are the same as before. Now examine (7.10.17). It is evident 
that we may absorb the whole of the correction term into just such a 
variation of g. In fact the necessary change in g is of order 
In? (In(1/a))/In(1/a) as a0. So we may choose the bare coupling to be 


E 1 Aln [In (1/a?u?)] 
A, In(1/a2u?) A? In?(1/a? u?) 


1 1 A 
SG a ind gd | 
+ aT Gio g? A, n( 19 ) so | 

(7.10.21a) 


Hence in (7.10.16), we can also drop the O {In?[In(1/a’)]/In°(1/a)} terms. 
So we have the following formula for gọ in terms of a A alone: 
1 A, \In[In(1/a7A7) ] 2G 10 


gm a tag a: (710.210 
95A maA) Asln*(1a®A®) AFA UO”? 
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In the case of the wave-function renormalization Z, the uncalculated 
corrections can be absorbed into a factor {^ multiplying the Green’s 
function Gy. This factor must approach unity in the continuum limit. Hence 
we may use 


a j = —Cy/2A, g : x(g’) C; 
z=| 4 in ayp J H ~ pa TAg i | 
(7.10.22) 


where M is an arbitrary mass that is irrelevant when a —> 0. Notice that for 
the coupling we had a form (7.10.21 b) that had dependence on A, but not on 
u or on g. This was because g, is renormalization-group invariant: we may 
take u arbitrarily large without affecting g,, provided that we also set g 
equal to the effective coupling at u. When u is very big, g is very small, and 
the higher-order corrections contained in f(g) go to zero. But Z is not 
invariant ; it must depend on g. What we can say is that any dependence ona 
of the form 


Z = finite- [In (1/a)] 7 124 


will produce finite Green’s functions. 
Notice that if the one-loop divergence in Z vanishes, then we may let Z be 
finite: 


Z= exp] | dg’ 19/600) | 
(0) 


There will in general be divergences in the self-energy graphs in higher 
orders. What we have proved is that they must sum to something finite. 

In the case of g,, any a-dependence of the form 

g% =1/A, In(1/a?) — A, In [In (1/a) ]/[A}1n?(1/a’) | + finite/In7a 
will give finite renormalized Green’s functions. Only knowledge of A, and 
A, is necessary for this. They are obtained from one- and two-loop 
calculations. The coefficient of the 1/In*a determines the value of g. 

The formula (7.10.21b) shows the fundamental significance of the A- 
parameter. In a renormalizable field theory, there are divergences, so one 
cannot simply specify a single number as the bare coupling constant. 
Rather, one must construct the theory as the continuum limit of some 
lattice theory, with g, depending on the lattice spacing, a. Equation 
(7.10.21b) gives g, as a definite numerical function of a. 

Unfortunately, there is a certain arbitrariness in precisely how one 
constructs a lattice approximation to a continuum theory. This arbitrari- 
ness is physically irrelevant (although some particular approximation may 
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be superior when it is used for a numerical calculation). So (7.10.21) is also 
important because it expresses the bare coupling in terms of quantities (A, 
A,, and A,), which have direct meaning in the continuum theory, and in 
terms of one number G,,, which depends on the lattice approximation, but 
which can actually be computed analytically (Hasenfratz & Hasenfratz 
(1980)). The result of such a lattice calculation is normally given as the ratio 
of a Ajattice tO the value of A in some standard continuum renormalization 
scheme. The definition of Aattice is the value for which 


96 = 1/[A, In(1/a* A firtice) ] 


— Az In[In(1/a?A Juice) ]/[4} In2(1/a7A Lice) ] (7.10.23) 
gives the same continuum limit as (7.10.21). It 1s easily checked that this is 
Miattice = A EXP (G4 9/A,). (7.10.24) 


Despite the fundamental significance of A, there is a convention 
dependence in its definition. In specifying a theory by its value of A, one 
must specify these conventions. This is analogous in its effect to the need for 
specifying a system of units in electromagnetism. The main convention is 
that of the renormalization prescription. The other convention is the one 
implicit in the choice of the constant in (7.5.3). It is sensible to follow the 
usual convention, to avoid confusion. 

We have seen that higher-order corrections (beyond two loops) do not 
enter into our formula for g, in terms of A. This is in contrast to (7.10.21a), 
which expresses g, in terms of g and pu. So it is sensible to treat A as a 
fundamental parameter of the theory — say in strong interactions. But 
practical considerations intervene if one tries to measure A. A typical 
measurement consists of measuring a quantity for which a useful per- 
turbation expansion exists (for example, a jet cross-section in e* — e~ 
annihilation). The ex periment therefore measures the effective coupling g(u) 
at some value of u which is of the order of the energy of the experiment. 
There are errors in this measurement caused by uncalculated higher-order 
terms in the theoretical calculation of the cross-section, not to mention non- 
perturbative corrections. We can then deduce A from (7.5.6), with further 
errors due to corrections in $. 

Since g is more directly related to the size of the cross-section, it is 
perhaps correct to argue that experiments should quote their results as a 
value of g. But to give the value of A is equally valid. However, a small 
fractional error in g corresponds to a much larger fractional error in A. This 
can be seen from (7.5.6). If we change g and A while holding yp fixed, then 


|dA/A| = |dg/g|[24 19°] [1 + 0(g°)]. 
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If one could do a real calculation of the mass of, say, the proton in QCD, 
then it is the error in the value of A that would determine the error in the 
mass. As we saw in Section 7.9, when we discussed dimensional transmu- 
tation, the mass 1s proportional to A. 


7.10.5 m? 


Unfortunately mē has a 1/a? term, but the variation of mọ with m? depends 
on a [In(1/a)]~°'*’4! term. So we need the coefficient of 1/a? to very high 
accuracy. Any slight error (say of order 1/a) will be equivalent to making the 
renormalized mass diverge like 1/a as a0. The resulting need to be very 
accurate in m, leads many people to consider scalar field theories 
unnatural. 

In the case of fermion theories there is a symmetry under y > y,w when 
m = Q, so the Y term is absent and we have 


Mo = mZ,, =~ m-constant [In(1/a) ]~ 31/741. 


7.10.6 Non-asymptotically free case 


The values of g), etc., as a > 0 are not perturbatively computable unless the 
theory is asymptotically free. However, if we suppose that B in a non- 
asymptotically free theory has a fixed point, then we may write 


9o(9, au) = 9o(G(1/a), 1) 
> g,(g*, 1) asa—0. (7.10.25) 
Note that g,(g(1/a), 1) is a finite function of g, so the limit exists. However 
the same value is obtained for g, at a = 0 for any value of g(). So the way in 


which the limit is approached determines the value of g. 
An example is easily constructed. Suppose we have a theory in which 


B(g) = sin?(g”)/(2g), (7.10.26) 
and 


9o(9, 1) = 9. (7.10.27) 
Then the effective coupling has the form 
g = [arctan (1/ln (A/p)) |]. 


There is a fixed point g* = n'/*. We therefore find that the bare coupling as 
a-—(0 must be 


9o(g, au) = [arctan (1/In (aA))]*/ 


1 
= n"? ————————— 4 (1/In2(aA)). 7.10.2 
endian CAAD ER 
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It is necessary to know how g,(g,ayu) approaches its limit gọ(g*,1) in order 
to determine the value of A. 


7.11 Computing renormalization factors using dimensional regularization 


In the previous section, Section 7.10, we computed how the bare coupling go 
should behave as a function of the lattice spacing a. In this section we 
present the corresponding argument using dimensional continuation as the 
cut-off. We do this by treating the defining equation (7.3.10) of B as a 
differential equation to compute g,. Our argument will be valid in any 
asymptotically free theory, like ø? theory in six dimensions or QCD in four 
dimensions. If we let d, be the physical space-time dimension, then we 
regularize by going to a lower dimension d = dọ — €. 

First we compute the relations between lower and higher poles in the 
renormalization. Now we write 


Jo =u g + } dg)e~? | (7.11.1) 
j=1 
and we have the definition of $: 
€g,/2 + B(g,d)ag,/dg =0. (7.11.2) 


Let us expand (7.11.2) in powers of e. The terms proportional to and ° give 
us: 

B = — eg/2 + P(g) = — £g/2 + 3(g0/6g — 1)d (9). 
We have changed notation from our original definitions to correspond to 
the definitions that we used in Section 7.10 for the lattice cut-off. There we 
defined B to be RG coefficient in the cut-off theory, while we defined £ as the 
limit of B as the cut-off is removed. 

Now, the coefficient of the pole e~ + in (7.11.2) is 

3(1 — gd/0g)d,.. :(g) + B(g) Ad ;/dg = 0. (7.11.3) 
This is a differential equation which, when solved using the boundary 
condition d (0) = 0, gives all the higher coefficients d,(g) in terms of the 
single pole d,(g). 

Similar relations (t Hooft (1973)) hold for all renormalization counter- 
terms. The structure is similar to the leading logarithm expansion. They 
show that in each order of perturbation theory the only new information in 
the counterterm in a given order of perturbation theory is in the single pole. 

A convenient way of solving these relations is to work out the solution of 
the differential equation (7.11.2). This gives 


á 1 1 
l udu] = | dg ———_——_ —— Ing, (7.11.4 
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l.e., 


” 2B’) l 
oae T e A E E. E 6 
Jo = T I g% a | i 


The boundary condition g/g > u”? as g +0 has been used. 

We now ask how g, must behave as £ = dọ — d —> 0, with g (and u) fixed. If 
the theory is not asymptotically free (so that in B(g)= — A,g? +7, A, is 
negative), then the integrand has a pole at 

g? = — (dy —d)/2A, + O(d — do}. 

The solution (7.11.4) only unambiguously exists if g? is less than this value, 
which is zero when d = d,. To get to the d = dọ theory with g non-zero we 
must continue g, so that the integration avoids the pole. The result is that go 
has an imaginary part. This, among other things, suggests that the theory is 
unphysical (see Wilson (1973), Gross (1976)). Recently, evidence has 
accumulated that the lattice ¢* theory does not have a non-trivial 
continuum limit — see Symanzik (1982) for a review. 

If the theory is asymptotically free then we may continue (7.11.4) to 
d=d,, i.e., ¢=0. The integrand becomes singular when e=0, and we 
examine the singularity by expanding in powers of g’: 


Ing, =Ing +5Iny 


-5f a EEn na } 
21, 7 \[-9/2+B@)] [—-eg/2-A,9"] 
Serer aa 
[— eg'/2 — A g° TF 


T l A9” 2 
B aiar toe ate . (711.6 
l, [—eg'/2— A,g°] [-eg/2-4,g°] eg ee 


In the first integral we may set € =O and have errors that are o(1). So 


+§} my- — Fen) t i-r |+ oo (7.11.7) 


where f(g) is defined by (7.10.18). Thus 


A, 2 
=ni + ah —In (2) |o} (7.11.8) 
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7.12 Renormalization group for composite operators 


We have seen how a change in renormalization prescription for the 
interactions of a theory can be compensated by a change in the values of the 
renormalized parameters. The same property holds for the composite 
operators we defined in Chapter 6. 

For example, consider the renormalized [ #7] operator in ° theory at 
d = 6. In Section 6.2 we calculated it in the one-loop approximation: 


0|T (x) O(y) $7 ](2)/2|0> = 
= tree graph + {one-loop graph + counterterm graphs} +--- (7.12.1) 


A change in renormalization prescription amounts to a finite change in the 
counterterm graphs. Since the counterterms are of the form 


4[ 67] =4¢7 + 462,67? + 6Z,m?¢ + 6Z,.O¢ + higher order, (7.12.2) 


we have 

al Inewre = 21° Jone + 2al] + bd+cOd. (7.12.3) 
Here a,b, and c are finite quantities of the same order in the coupling as the 
one-loop counterterms. The equation (7.12.3) is, so far, only derived at the 
one-loop order -so the finite counterterms are to be used with their 


operators inserted in tree graphs. 
Let us examine the situation we expect to all orders. We will use minimal 
subtraction. Then the renormalization in the notation of (6.2.12) is 


alL] =37Z,Z2 oo + u??? ZZ m bo + ut? ZZ"? Go. 
(7.12.4) 


Now the bare field is independent of u, so we may write 
we MG?) =Z,2- 492 In(Z,/Z) 
du? a 270 du a 

ate oe a amdon Inl"? ZyZ- 1/2 m?) 
EE Toa "2.2 1/2 E dongin (u? ZZ HR) 

= 1[ġ?] u n(Z,/Z) 
2 du a 

d 

i u’? Amoi me LLT A ! m?) 


d 
PEPS Doug PURA, (7.12.5) 
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which has the form 


d 
Ha 2LO7] = nlg] + yom? wt? + yen? CG. (7.12.6) 


We can formulate this as a matrix equation: 


11 g?] Z2 mZ N ue VAA io? 
h = 


= ‘a Zhe 0 Po = MQ), 
O¢ 0 0 Ze C do 
(7.12.7) 
d dM 
— © =1y——-M'@=y,9. 12. 
fap ag D= Yo (7.12.8) 


The coefficients y, yp, and y, are finite at d= 6. From our calculations in 
Section 6.2, we have 


5g? 5, 
la = 38403 6” 
ke ey (7.12.9) 
b 64r? ? 
Toal 
Y= Zgan T6”? 
where J = g*/64n°. Thus 
3 Am? u’? Apa2- 
Yo= : K n + higher order. (7.12.10) 
0 0 69 


Observe that y,, Yp, and y, are all independent of u and m. This follows from 
the same arguments that we used to prove the same property for y,, and y. 


From the RG equation (7.12.7) we prove renormalization-group equa- 
tions for Green’s functions of the composite operators. For example: 


0 ô 0 
(uz + Pa, = Ya a KO T $(x)$(y)2L $*(2)]|0> 


=(y,—y)<0|T¢¢5[¢7]|0> 
+ (ym? +y. OOIT ely), (7.12.11) 


where we have used udo/du = — yġ/2, and we have set d= 6, thus 
eliminating the p“?~° factors. 
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We must prove (7.12.8) both to all orders for the [@? ] operator and in its 
generalizations to deal with any operators. Since bare operators are 
automatically RG invariant, the only question is whether the anomalous 
dimensions are finite. This is handled by a simple generalization of the proof 
given in Section 7.2 for the ordinary Green’s functions. We will not spell out 
the details — for that is just a mathematical exercise. 
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Large-mass expansion 


A common situation in physics is that in investigating phenomena on a 
certain distance scale, one sees no hint of those phenomena that happen at 
much shorter distance scales. In a classical situation this observation seems 
evident. For example, one can treat fluid dynamics without any knowledge 
of the atomic physics that generates the actual properties of the fluids. 
However, in a quantum field theory this decoupling of short-distance 
phenomena from long-distance phenomena is not self-evident at all. 
Consider an e* — e~ annihilation experiment at a center-of-mass energy 
well below 10 GeV, the threshold for making hadrons containing the b- 
quark. There is, for practical (or experimental) purposes, no trace of the 
existence of this quark. However, the quark is present in Feynman graphs 
as a virtual particle, and can have an apparently significant effect on cross- 
sections. Our task in this chapter is therefore to prove what is known as the 
decoupling theorem. This states that a Feynman graph containing a 
propagator for a field whose mass 1s much greater than the external 
momenta of the graph is in fact suppressed by a power of the heavy mass. 
The physics at low energy is described by an effective low-energy theory 
that is obtained by deleting all heavy fields from the original theory. 
The decoupling of heavy particles is not absolutely universal. One 
important and typical exception is that of weak interactions. Let us 
consider the interactions of hadrons at energies of a few GeV. The effective 
low-energy theory, in the sense just described, consists of strong and 
electromagnetic interactions alone, without weak interactions. So weak 
interactions should be ignorable at low energies. However, it is well known 
that there are in fact many observed processes, particularly decays, that are 
due entirely to weak interactions. The point is that, in the absence of weak 
interactions, these processes are exactly forbidden by symmetries, such as 
parity, charge-conjugation, and strangeness conservation. Weak- 
interaction amplitudes for the processes in question would be power-law 
corrections — suppressed by a power of energy divided by the mass of the 
W-boson — were it not that they are corrections to zero. The consequence of 
this particular situation is that, at low energies, weak interactions are 
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described by a non-renormalizable theory, viz., the four-fermion in- 
teraction. Efforts to find a renormalizable theory led to gauge theories, and 
a prediction of the W- and Z-bosons from phenomena at energies much 
lower than their masses: low-energy phenomena have indeed provided 
clues as to what might happen at much higher energy. 

In this chapter, we will treat the cases where decoupling occurs. The 
theorem that tells us to expect decoupling to occur in many theories was 
formalized by Appelquist & Carazzone (1975) and Symanzik (1973). They 
work with a renormalizable theory in which some fields have masses very 
large compared with the others. They then consider Green’s functions of the 
low-mass fields at momenta much less than the large masses. The theorem is 
that the Green’s functions are the same as those in an effective low-energy 
theory obtained by deleting all of the heavy fields. Corrections are smaller 
by a power of momentum divided by a heavy mass. The sole effect of loops 
of heavy particles is that the couplings of the low-energy theory can have 
different values from those in the complete theory. 

Since the renormalized couplings have no particular a priori value, the 
heavy particles are unobservable until close to threshold. The practical 
importance of the theorem is that one can understand low-energy physics 
without having a complete Lagrangian for all phenomena. 

We will also show how the renormalization group can be applied in the 
computation of the relation between the couplings of the low-energy 
effective theory and those of the full theory. 

There are many ramifications of the decoupling theorem, but we will not 
treat these. One of these is the detailed application of the decoupling 
theorem to gauge theories (see, for example, Kazama & Yao (1982)). 
Another is the large-mass expansion of Witten (1976) — where Green’s 
functions of the heavy fields are computed; this expansion is used is deep- 
inelastic scattering. 

We will also not treat the exceptions to the decoupling theorem. These 
can be treated by the same techniques as those used to prove the decoupling 
theorem itself. We have already mentioned weak interactions as one of the 
typical exceptions. Let us just note two other main classes of exception: 


(1) In theories with spontaneous symmetry breaking, a mass is often made 
large by increasing a dimensionless coupling (Veltman (1977) and 
Toussaint (1978)). The decoupling theorem assumes that a mass is made 
large by increasing dimensional parameters. 

(2) Some dimensionless couplings needed by power-counting violate 
renormalizability of the low-energy theory (see Collins, Wilczek & Zee 
(1978)). 
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In any event the effective low-energy theory is non-renormalizable. 

It might be supposed that since General Relativity is non-renormalizable 
in perturbation theory, it contains some clues to phenomena at very high 
energies (see, for example, Hawking & Israel (1979)). 


8.1 A model 


We will restrict our attention to a very simple model. It is a f° theory with 
two fields in six space-time dimensions: 

L = (p) /2 + (Ag,)"/2 — m*p;/2 — M*g,/2 

— p’ 971 g,6;/6 + g2¢,04/2] — u? > fd, + counterterms. (8.1.1) 
Symmetry under ¢, — — @¢,, has been imposed to cut down the number of 
possible couplings. Then (8.1.1) contains all couplings necessary for 
renormalizability. We assume that the renormalized mass, M, of the heavy 
field is made large while all other parameters are held finite. The factors of 
the unit of mass u needed with dimensional regularization are explicitly 
indicated. 

All our techniques can be readily extended to treat more complicated 
(realistic) theories. 

As usual we have introduced a linear term in the Lagrangian to cancel the 
tadpole graphs. This is determined by the renormalization condition that 
<0/¢,|0> =0. 

The remaining counterterms can be put in the form 

L a = (Z, — 1)07/2 + (Z, — 1)0 G4 /2 

—[m(Z,, — 1) + M?Z,,y|o7/2 — [M7(Zy — 1) + m’ Z uml Pr/2 

— p’ -HE CET: — 91); /6 + (G23 — 92), >4/2] 

— pt? fo -SQ (8.1.2) 
As usual, we may choose the dimensionless renormalizations (viz., the Z’s 
and the gp’s) to be independent of the dimensional parameters m?, M?, 
and f. 

The decoupling theorem asserts that phenomena on energy scales much 
less than M are described by an effective low-energy theory whose 
Lagrangian has the form 


L ae = 20/2 — m*?2 7/2 — p? ~t g*27!7 3/6 — uz f *g, 
+ counterterms 
= Cp*? /2 oe m** *? /2 _ p> 4/2g* b*3 /6 A Ta a 
+ counterterms. (8.1.3) 
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Here we have defined a scaled field @* = z’/*,. We will prove that g*, m* 
and the coefficient z can be chosen so that Green’s functions of ¢, obtained 
from &,,, differ from those obtained from the full Lagrangian (8.1.1) by 
terms which are of the order of a power of external momenta divided by M. 

It is usually convenient to work with the scaled field @* which has unit 
coefficient for its kinetic term in the basic Lagrangian. Then Green’s 
functions in the full theory are related to Green’s functions in the low- 
energy theory by 


Gy(Dy5-++5 Pri 9v In M, M, p) = (O|TH,(P1)---Fi(Pw)|0 > fun meory 

=z "?GR(D,,..-.Pyig*sm*, u)[1 + O(1/M°)] 

=z7M2<0|T*(p,)...ġ*(p)|O>[1 + O(1/M%)], (8.1.4) 
as M > œ with p,,..., py fixed. The fractional errors go to zero as a power 
of M times logarithms; the power is typically M~?. We can therefore use 
1/M°, with a slightly less than two, to bound the error. As is our convention, 
the tilde signs over the fields and Green’s functions indicate a Fourier 
transform into momentum space. 


8.2 Power-counting 


In this section, we will establish the rules for finding the leading power of M 
in the value of a graph as M — oo. These form a simple generalization of 
Weinberg’s theorem, and will involve us in understanding which regions of 
momentum space are important. We will mostly be interested in graphs for 
the Green’s functions of the light field ¢,. Our aim will be to find those 
graphs that contain lines for the heavy field and that do not vanish as M 
goes to infinity. 


8.2.1 Tree graphs 


Because we choose to impose the symmetry ¢, > — p the only tree graphs 
containing lines for the heavy field have heavy external lines. An example is 
Fig. 8.2.1. Since all momenta on the lines are fixed, and since the free ¢,- 
propagator is 


i/(p* — M°) ~ —i/M?, 


< 


Fig. 8.2.1. A tree graph with a heavy line. 
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the behavior of any given tree graph as M —> œ is 
M77 (8.2.1) 


where H is the number of heavy lines. (We use the natural terminology of 
calling a line of a Feynman graph heavy or light according to whether its 
free propagator is for ¢, or ¢, respectively. Our graphical notation is that 
heavy lines are thicker than light lines.) 


8.2.2 Finite graphs with heavy loops 


Consider now a graph that has one or more loops but no ultra-violet 
divergences or subdivergences, and that has some heavy internal lines. The 
lowest-order example is Fig. 8.2.2(a), for the four-point function. Its 
external momenta (if small) may evidently be neglected on the lines of the 
loop, whose value is then 


4 1 ‘nt 
92 [irg a aee (8.2.2) 


2 (2m) (k= M? ~ 384r? M? 
(We label the symbol T by the figure number.) 
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Fig. 8.2.2. Large-mass behavior of graph without an ultra-violet divergence. 


The graph vanishes as M — œ. The precise power of M can be obtained 
by considering the possible regions of momentum space (after Wick 
rotation), as follows. Any region of k that is finite as M —> œ gives a 
contribution of order M~°. Since the graph is UV finite, the only other 
possibility is k = O(M). Simple power-counting gives M~?, as found in 
(8.2.2). This power-counting is the same as for the UV degree of divergence. 

The graph is negligible (by a power of M?) compared to graphs with no 
heavy lines. If, nevertheless, we wanted its leading contribution, then it 
would be effectively the local four-point vertex symbolized in Fig. 8.2.2(b). 
The non-renormalizability of this coupling (when the space-time dimension 
is six) is tied to the negative power of M7’. 


8.2.3 Divergent one-loop graphs 


Consider the logarithmically divergent vertex graph in Fig. 8.2.3(a). After 
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Fig. 8.2.3. Large-mass behavior of graph with an ultra-violet divergence. 


minimal subtraction the loop gives 


iq? 1 1-x 
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The same power-counting as for Fig. 8.2.2 confirms the power M° for the 
leading behavior as M > œ. There is also a logarithm. This occurs because 
two regions contribute to the leading behavior: the first is where the loop 
momentum k is of order M. The second region is the UV region where 
k — oo. After subtraction of the ultra-violet divergence a finite contribution 
remains. 

Evidently the graph gives a contribution that increases with M. 
Fortunately the non-vanishing part of the loop is independent of the 
external momenta. So for large M, the loop is effectively a three-point 
vertex, as Shown in Fig. 8.2.3(b). A proof which generalizes to higher order is 
to differentiate with respect to any external momentum. Since the 
differentiated graph is finite, it vanishes when M — œ, like a power of M. 

Recall our statement of the decoupling theorem, that at low energies, we 
could calculate Green’s functions from the effective low-energy Lagrangian 
(8.1.3). The result of our calculation of Fig. 8.2.3 is that the graph generates 
an extra piece in the ¢; coupling of the low-energy theory. Let us therefore 
write 


g? M* 
3/2 k — _ l SEEE 5 LA 
zg" =g; ERIS a(i) [+00 ). (8.2.4) 
We may drop the graph Fig. 8.2.3 and replace it by the order g? term on the 
right of (8.2.4). The loop has been replaced by a local vertex where all the 
lines come to a single point. This corresponds to the fact that the internal 
line is far off-shell and can only exist for a short time. 
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Fig. 8.2.4. Large-mass behavior of graph with a quadratic ultra-violet divergence. 


The self-energy graph, Fig. 8.2.4, gives a leading term of order M’. The 
value of the ais is 


RP.) = Tlo- WM -pp 


! a a, ae 
+ dx[M? — p*x(1 -oja “P| 
0 


An 


ig? fa M? \]_ p ( M? 
7 Nel otal =) Seo in 
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p 
ip o( fa) (8.2.5) 


Again the loop momentum k can be either UV or of order M to contribute, 
so there will be at most a single logarithm of M?/y?. Since we have to 
differentiate three times with respect to p, before obtaining a convergent 
graph, the non-vanishing terms, as M > oo, are quadratic in p. From the 
effective Lagrangian (8.1.3), we see that the graph may be pupae by a 


contribution to the basic self-energy vertex i[(z — 1)p* —(m**z — m’)] in 
the low-energy theory, with 
2 2 
g2 M 4 
=1— =~] y+ In| —> } |+ 0(9°), 8.2.6 
: reek i o( >) | (g) l 
2 2 2 
ear ia a aoe 
zm m A? + In qe + O(g”). (8.2.7) 


We can now compute g* and m*: 


2 1 2 
93(92 — 491) M 
g* =g,- A A ore : [y+ 1n( 2) |+ 00%, (8.2.8) 


2 M2 
er ee g2 2 n 
m m fis M £ +in( a) 
M? 
al 4 
m | y+in( He ) [+00 (8.2.9) 


Notice that there is a contribution of order M? to the self-energy and 
gy 
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hence to m*?. In order to keep the physical mass of @, finite and hence keep 
m* finite as M > œ, we must let m?” have a term proportional to g? M? (with 
higher-order corrections): 


Any? 


On expanding m*? in powers of coupling, we find 


M? M? 
m? = finite + e|? —1+In (a) | + higher order. (8.2.10) 


m** = finite term in m? 
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Since m is the mass parameter for the light field, it is generally considered 
unnatural to have to fine-tune it within a fractional accuracy of m*?/M?, as 
is required by (8.2.10), to obtain a finite value of m* when M — œ. In the 
context of grand unified theories this is called the gauge hierarchy problem 
(Weinberg (1974, 1976), Gildener & Weinberg (1976)). It is hoped to solve it 
by finding a phenomenologically sensible theory with no need for fine- 
tuning. 


8.2.4 More than one loop 


We may have one of the divergent one-loop graphs occurring inside a 
larger superficially convergent graph. A typical example is Fig. 8.2.5. When 


k l A(g*z?/?) 


Fig. 8.2.5. Large-mass behavior of two-loop graph with an ultra-violet divergence. 


M> oo with the external momenta fixed, the only region of the loop 
momenta that gives a non-zero contribution is where the outer-loop 
momentum | is finite and the inner-loop momentum k is of order M or 
larger. So the heavy loop can be replaced by its effective low-energy vertex 
computed at (8.2.3). This procedure does not change the overall degree of 
divergence. 

The situation at higher order or with overall-divergent graphs is more 
subtle as we will now see. The graph of Fig. 8.2.6 is typical. Now, it contains 
a subgraph, consisting of the heavy loop, which we have already considered. 
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Fig. 8.2.6. Large-mass behavior of another two-loop graph with an ultra-violet 
diver gence. 


Therefore, the low-energy theory contains a graph where the heavy loop is 
replaced by a vertex using (8.2.4) for g*z°*/*—g. This graph exactly 
reproduces the region where k is finite and lis large for Fig. 8.2.6. We add 
and subtract this graph from the original graph as indicated in the figure. 
The subtracted term (in square brackets) has a vanishing contribution from 
finite k (as M > œ). So we replace it by an effective vertex As. The same 
arguments as we used for one-loop self-energy, Fig. 8.2.4, show that it has 
three terms, proportional to p°, m*, and M?, with coefficients polynomial in 
In (M?/7). 

In this and in other graphs there are UV divergences for the whole graph 
and for subgraphs. Implicitly, the counterterm graphs are to be included. 
Provided we use mass-independent renormalization we are guaranteed 
that the counterterm graphs satisfy the same power-counting as the original 
graphs. In particular they are polynomial in the light masses. Thus the 
counterterm graphs do not change the power-counting and differentiation 
arguments that are crucial to our work. 


8.3 General ideas 


Structurally, the arguments in the last section appear similar to those we 
used in Chapter 7 to show that renormalization-prescription dependence 
can be compensated by finite counterterms. In fact, as we will see in the next 
section, Section 8.4, a proof of the decoupling theorem can be constructed 
exactly by changing the renormalization prescription. We will show that a 
renormalization prescription can be chosen to have a number of convenient 
properties, the most important of which is that the low-energy theory is 
constructed simply by deleting all heavy fields without changing the 
couplings and masses of the light fields. This property is called manifest 
decoupling, and we will explain it with the aid of an example in 
subsection 8.3.1. 
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Our approach follows the method given by Appelquist & Carazzone 
(1975) and Witten (1976). This approach generates the effective theory as a 
series of subtractions. The simplest non-trivial case is given in Fig. 8.2.6. 

There is another approach due to Weinberg (1980) in which the 
decoupling is considered by first integrating over the heavy fields in the 
functional integral. (See also Ovrut & Schnitzer (1980).) This method is less 
convenient for treating graphs like Fig. 8.2.6, so we do not use it. 


8.3.1 Renormalization prescriptions with manifest decoupling 


Suppose we used BPH(Z) renormalization instead of minimal subtraction. 
Then the renormalization condition is that the terms up to p*”” are zero in 
the Taylor expansion of a graph IT about zero external momentum. Here 
6(T) is the degree of divergence. For a graph with a single loop, consisting of 
a heavy line, these terms are precisely those that are non-vanishing as 
M > œ. Examples are given by the graphs of Figs. 8.2.3 and 8.2.4. 

In fact, for a general graph, the effective low-energy theory in this 
renormalization presciption is obtained merely by deleting all graphs 
containing heavy lines, together with all their counterterm graphs. The 
values of the couplings and masses are not changed. Therefore the BPH(Z) 
prescription has the property we called ‘manifest decoupling’. It might 
appear sensible always to use a renormalization prescription that has this 
property. However, for many purposes it is useful to use other re- 
normalization prescriptions, e.g. minimal subtraction and its relatives. 
Particular cases are theories containing massless fields, especially non- 
abelian gauge theories, and theories with spontaneous symmetry breaking. 
In any case, it is good to have a direct method of proof of decoupling that 
can work with any prescription. Furthermore, a prescription like minimal 
subtraction is more convenient if one also wishes to compute high-energy 
behavior (Section 7.4) with the aid of the renormalization group. In fact, the 
method we will use will start from a mass-independent renormalization 
prescription defined for both the full theory and for the effective low-energy 
theory. Then the renormalization of the low-energy theory is extended to a 
renormalization prescription of the full theory in such a way as to satisfy 
manifest decoupling. This method was first stated by Collins, Wilczek & 
Zee (1978). 

One renormalization prescription that gives manifest decoupling at low 
energies and that allows the use of renormalization-group methods at high 
energies is due originally to Gell-Mann & Low (1954). In this scheme, one 
makes subtractions at some arbitrarily chosen value of momentum. This 
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scheme was applied to the large-mass problem by Georgi & Politzer (1976). 
The disadvantage of this scheme, compared with the scheme that we will 
actually use, is that renormalization-group coefficients are explicitly 
functions of M/p, and of m/: 


By = By(91,92;M/p, m/p). 


This makes calculations complicated. Furthermore, this scheme obscures 
some symmetries. 


8.3.2 Dominant regions 


Before actually constructing a proof of the decoupling theorem, let us give a 
precise statement of the regions that give unsuppressed contributions (i.e. 
not suppressed by a power of M7’). We consider each graph in the full theory 
together with the set of subtraction graphs needed to cancel its divergences 
and subdivergences. We do not consider the subtraction graphs separately. 

First of all, any graph with no heavy lines at all contributes without 
suppression. 

A graph with one or more heavy lines cannot give a contribution unless 
at least one loop momentum is of order M. The contribution to a graph 
when M is large can be considered as the sum of contributions from various 
possible regions of momentum space. The regions can be specified by the 
sizes of the loop momenta. For our purposes, it is enough to classify a 
momentum as either finite or large. ‘Large’ we define to mean ‘of order M or 
bigger’. We can do power-counting for each region in the obvious way. For 
the loops carrying large momenta, counting powers of M is the same as for 
the ultra-violet degree of divergence. This gives a factor Mê, where ô is the 
ultra-violet degree of the lines carrying large momenta. A heavy line 
carrying finite momentum counts as M~?. A light line carrying finite 
momentum counts as M°. 

The leading power of M for a graph is obtained as the maximum of the 
powers for the possible regions. Let us define 6,,(I’) to be this highest power. 
In general there will be logarithmic enhancements. But Weinberg’s theorem 
guarantees that the power 6,,(T) is correctly given by considering only the 
regions we have listed. The graphs treated in Section 8.2 provide examples 
of this procedure. (The subscript ‘M’ is to distinguish 6,,(I-) from the ultra- 
violet degree of divergence.) 

A region contributing to a leading power that is M? or bigger is 
symbolized by contracting to a point both the heavy lines and the lines 
carrying large momentum. The points represent vertices in the effective 
low-energy theory; we have already used this notation in Section 8.2, in the 
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Fig. 8.3.1. Graph with two contracted Fig. 8.3.2. One-particle reducible sub- 
subgraphs. graphs may have to be contracted. 


figures. In general (see Fig. 8.3.1) the contractions will result in several 
vertices. We include in our definition the restriction that a subgraph is only 
contracted to a point if it contains at least one heavy line. A contracted 
subgraph is 1PI in the light lines; for if it can be split into two parts by 
cutting a light line then that line is not carrying a large loop momentum. 
However, the contracted graph may be 1PR in the heavy lines. For 
example, in a theory where the symmetry ¢, — — Q, 1s not valid, a graph 
like Fig. 8.3.2 gives a leading power M°; the self-energy gives a power M? 
which cancels the 1/M? in the propagator. 

A subgraph that is contracted to a single vertex gives the same power of 
M as its UV power-counting. So 


dim (subgraph) = power of M + dim (couplings). 


Hence in a renormalizable theory (where couplings have non-negative 
dimension) the only contracted graphs that have a non-vanishing value as 
M > œ correspond to vertices whose couplings have non-negative dimen- 
sion. These vertices give the difference between g*z°’? and g, etc. Thus the 
couplings in the effective low-energy theory satisfy the dimensional 
criterion for renormalizability. In a scalar theory, this implies actual 
renormalizability, provided all the couplings are used that have non- 
negative dimension and that obey the symmetries of the full theory. 


8.4 Proof of decoupling 


8.4.1 Renormalization prescription R* with manifest decoupling 


Let us work with the theory defined by (8.1.1). We choose to renormalize 
it according to a mass-independent prescription, which we will denote 
by a symbol R. For definiteness we choose this to be minimal subtraction. 
By deleting all heavy fields from (8.1.1) and by changing the values of the 
couplings we obtain the form of expected low-energy theory (8.1.3). We 
choose to renormalize the low-energy theory by a mass-independent 
prescription R*, which we also take to be minimal subtraction. 

Our proof will consist of extending R* to a renormalization of the full 
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theory. The extension will satisfy manifest decoupling. Since different 
renormalization prescriptions differ only by a reparametrization, the 
statement (8.1.4) of the decoupling theorem will hold. The structure of R* in 
the full theory will give a ‘mass-independent’ form for g*, m*? and z: 


g* = 9*(91,92,M/w), 
Z=2(91,92,M/p), (8.4.1) 
m*? = m?z(9159>> M/w) T M? znm(91:92; M/H). 


Mass independence means independence of the light mass. As before, to 
save notational complication we choose to renormalize the linear coupling 
fo, by the prescription that <0|ġ,|0> =0. We then ignore both the linear 
coupling and the tadpole graphs. 

The reason we use mass-independent renormalization prescriptions for 
all the couplings other than the term linear in ¢ is that we can thereby make 
very clear the decoupling of phenomena at small mass scales from large 
mass scales. In addition, the renormalization-group equations for (8.4.1) are 
much simpler to work with than they would otherwise be. 

It is convenient to define two concepts: 


(1) A heavy graph is one containing at least one heavy line (i.e. a line for the 
heavy field @,). 
(2) A light graph is one that contains no heavy lines. 


For each basic graph T in the full theory we have a series of counterterm 
graphs that are used to cancel its divergences. If I is a heavy graph, then we 
also consider its counterterm graphs to be heavy graphs, even though they 
may contain no explicit heavy lines. 

We have chosen a renormalization prescription R* for the low-energy 
theory. This defines the renormalized value of any graph in the low-energy 
theory, and therefore of any light graph in the full theory. We now wish to 
extend this prescription to heavy graphs, in such a way that it satisfies 
manifest decoupling. That is, the renormalized value R*(T) of a heavy graph 
goes to zero as M > œ. The basic idea is to subtract such graphs at zero 
momentum. That this is a sensible procedure is easily seen by examining a 
few of the graphs from Section 8.2. 

For example, we saw that Fig. 8.2.3 diverges logarithmically when 
M > œ, if we use minimal subtraction. But with zero-momentum sub- 
traction we have 


i 3 1 1-x 25. 2 p2xy 
td al a ce -a 
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= O(p?/M*) asM-o. (8.4.2) 
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- Clearly, the difference between the two renormalizations is just the 
difference given by (8.2.4) for g*z*/* — g. At high energy we would use 
minimal subtraction — so M can be neglected compared with momenta — 
but at low energy we would use the R* prescription —so that we can 
simplify calculations by dropping heavy graphs. 


a aon 


Fig. 8.4.1. 


Consider next Fig. 8.4.1 for the self-energy of the heavy field. This graph 
contains both light and heavy lines. It would behave like M? In(M) for large 
M, if we used minimal subtraction. Instead, let us define the subtraction by 
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= 0(1/M?) asM>œ. (8.4.3) 
Here, we observed that when M — œ the dependence of the unrenormalized 
graph is linear in m? and p°. So we expanded about p=m=0 and 
subtracted the terms up to quadratic in m and p. This means that the 
counterterms are polynomial in m, i.e., ‘mass-independence’ holds good. 
Normally subtractions at zero mass and momentum have infra-red 
divergences, but the presence of a heavy line prevents this here. 

As a final example let us examine the two-loop graph of Fig. 8.2.6. The 
unrenormalized graph cannot be expanded about m=p=0 to give a 
counterterm, because there are two light lines. At m, p ~ 0 they give m- and 
p-dependence of the form 


(value of heavy loop at p =k =0) 2 2 
dé ow, (pp? + m?) In +m’). 
= i (k? — m*)[(p +k) — m°] ES i 
(8.4.4) 


The right-hand side of this equation is schematic and symbolizes the 
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(a) (b) 
Fig. 8.4.2. Counterterms for Fig. 8.2.6. 


maximum powers and logarıthms of m and of p that occur. However, to 
obtain the renormalized value of the graph we must first subtract 
subdivergences by the counterterms Fig. 8.4.2, constructed by the R*- 
scheme. Now, the counterterm graph (b) has infra-red behavior exactly 
equal and opposite to that of (8.4.4), because the counterterm is minus the 
value of the heavy loop at p = k =0. Thus the sum of the two graphs has the 
extra convergence we need. The overall counterterm is then linear in p? and 
m°. There are no logarithms of m as M > œ. 


8.4.2 Definition of R* 


To define the renormalization prescription R* in general, we simply 
summarize and generalize what we have just done for particular graphs. 

We define the renormalization prescription R* in the full theory to be the 
same as our chosen prescription for the low-energy theory whenever it acts 
on a purely light graph. For a heavy graph I’, we assume inductively that we 
have defined the quantity R*(I) in the usual way to.be the unrenormalized 
value of I plus counterterms in the R*-scheme to cancel its subdivergences. 
If F has degree of divergence ô(T)> 0, then its overall counterterm is 
defined by subtraction at m =p=0. The renormalized value of I" is 
R*(T) = R*(T) + C*(T), as usual. 

To define C*(I) precisely, we first expand R*(I) in a Taylor series about 
the point where its external momenta and the light mass m are zero. Pick 
out the terms where momenta and m° occur with dimension up to 6(I), and 
let the counterterm C*(I) be the negative of these terms. Our examples tell 
us to expect that with such a counterterm: 


(1) the leading M — œ behavior is canceled, 
(2) there are no IR singularities in the counterterm. 


We must prove these statements in general. The proof will generalize from 
the simplest non-trivial case, Fig. 8.2.6. There, the UV divergent 
unrenormalized graph is not polynomial in mand p, but after subtraction of 
subdivergences by the R*-scheme, it becomes polynomial. Then the R*- 
prescription can legitimately generate the overall counterterm. Moreover, 
after subtraction of the subdivergences, the leading large-M behavior is also 
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polynomial in m and p with degree equal to the degree of divergence, so that 
it is cancelled by the overall counterterm. 

Even with the subtractions for subgraphs, there are in general IR 
singularities in the Taylor expansion of a graph. For example, consider 
Fig. 8.4.1 and expand its integrand — see (8.4.3) — in powers of m? and p°. All 
the terms beyond the second give divergences at x = 0; it is only the terms 
needed to cancel the UV divergence that are non-singular. 


8.4.3 IR finiteness of C*(T) 


Suppose I" is a heavy graph, 1PI in its light lines. Potential infra-red 
divergences in C*(I) arise when m and the external momenta are made 
small. They come from regions where some or all of the loop momenta are 
of order m. The simplest case is where all the internal momenta are of order 
m. 

If T were a light graph, we would obtain a contribution of order m’°™, 
where 6(I) is the UV degree of divergence. So let us call — d(T) the 
canonical IR degree of divergence of I. If ô(T) = 0, this is a logarithmic 
divergence. If 6([) > 0, then the graph is finite as m—0. But to get the 
coefficients of the polynomial counterterms we differentiate up to ô(T) 
times with respect to m and the external momenta. The highest terms in the 
polynomial are therefore always logarithmically IR divergent, for a light 
graph. 

However, I is actually a heavy graph. So at least one of its propagators 
counts as 1/M7? instead of 1/m?. Thus all the counterterms have an IR finite 
contribution from this region, where all its loop momenta are small. 

This discussion is sufficient for all one-loop graphs. But multi-loop 
graphs have IR divergences coming from regions where only some loops 
have small momenta. For example, Fig. 8.2.6 has a divergence from the 
region where p and k are small, i.e., order m, and | is finite or large. This 
corresponds to IR degree — 2, and is given by (8.4.4). As we saw, the IR 
divergence is canceled by the graph with a counterterm for the heavy loop. 

The general case is that some light lines carry momenta of order m and 


(a) Fig. 8.4.1, with k and p of order m ___, fe 


(b) Fig. 8.2.6, all loop momenta large ——» ——e—— 


(c) Fig. 8.2.6, k and p of order m — > nOs 


Fig. 8.4.3. Examples of reduced graphs. 
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the remainder of the lines either are heavy or carry large momentum. Each 
such region is symbolized by a reduced graph in which the subgraphs 
consisting of the lines with large momenta and of the heavy lines are 
contracted to points. Examples of reduced graphs are shown in Fig. 8.4.3. 
Note: 


(1) Counterterm graphs can also have infra-red divergences. The counter- 
terms are inside the vertices of the reduced graphs. 

(2) All lines of reduced graphs are light, so at least one vertex of a reduced 
graph corresponds to a heavy subgraph. 


We can write the infra-red degree of divergence for the region cor- 
responding to a particular reduced graph y as 


Op (Ty) = — ôl) + J [ô(V) + IR degree of R*(V)]. (8.4.5) 
a 

The meaning of this equation can be seen from an example. Consider 
Fig. 8.2.6 when k and p are of order m. If the graph were purely light, we 
would have IR degree equal to — 2, which is the negative of the UV degree. 
This would imply that the m? and p* terms in the expansion about m = p = 0 
would be divergent. However, the single reduced vertex — as illustrated in 
Fig. 8.4.3(c) — has acounterterm. This counterterm ensures that the vertex’s 
value is of order m?/M7? instead of m°. The IR degree for the whole graph is 
thereby decreased by 2. The second term in (8.4.5), where the sum is over this 
single vertex, indicates this reduction. The degree for the region is then — 4; 
we can therefore expand up to order m? and p° without an infra-red 
divergence. The terms of order m4, p*, etc., are infra-red divergent, but they 
are not needed for ultra-violet renormalization. 

In the general case of (8.4.5), each reduced vertex V would contribute 
— 6(V) if it were light and all its internal lines had momenta of order m. But 
it actually contributes what we will now prove is a smaller amount. 
Remember that counterterm graphs also contribute, and we assume that 
counterterm vertices are included inside reduced vertices. The IR degree of 
R*(V) is its power as its external momenta are scaled like m. The possible 
cases for V are: 


(1) If V is overall convergent and contains a heavy line, then its infra-red 
degree is greater than its ultra-violet degree. Fig. 8.4.3(a) has a vertex 
with UV degree — 2 and IR degree zero. 

(2) If V is overall divergent and contains a heavy line then ordinarily we 
would expect it to behave as m? when m —> 0 with fixed M. But we make 
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subtractions by the R* scheme so that its behavior is actually m°” * 7. 


By induction we may assume its subtractions have no IR divergence. 


Hence, in every region of momenta a heavy graph T always has at least 
one mechanism to reduce its IR degree below — 6(I) and none to increase it. 
Thus the overall counterterm C*(T) is IR finite. It is crucial to our inductive 
proof that we first subtract subdivergences by the R* scheme. 


8.4.4 Manifest decoupling for R* 


A purely light graph is a graph in both the full theory and in the low-energy 
theory. It survives unaltered when we let M — œ. We will now prove that all 
the heavy graphs vanish when M > œ, given that we renormalize them by 
the R* scheme. 

To do this, decompose each heavy graph into its skeleton, i.e., a series of 
1PI graphs connected by lines that are not part of any loop. Since a heavy 
line that is outside a loop vanishes as M — œ, all heavy graphs vanish as 
M > oo, if the 1PI graphs vanish. 

The M > œ limit of a 1PI graph can be related to an IR limit by scaling 
all masses and momenta: 

M-1, p->p/M, m-m/M. 
Then 
T'(p,m, M) = M’PT(p/M,m/M, 1), (8.4.6) 
where d(T) is the dimension of I. So I vanishes as M > œ provided the 
infra-red behavior is less singular than m~“"”. But this is what we showed in 
the proof of IR finiteness of the counterterms. (Note that the dimension of a 
graph is greater than or equal to its UV degree of divergence.) 


8.4.5 Decoupling theorem 


We have constructed two renormalization prescriptions, labelled R and R*, 
for the theory under consideration. The Green’s functions in the schemes R 
and R* are equal provided we make appropriate changes in the parameters: 


A> 9", 
In Gn» 
coefficient of 0¢?/2 >z, 
coefficient of 0662/2 > z,, (8.4.7) 
m? + m*?, 
M? > M*?, 
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This is just a particular case of a renormalization-group transformation, 
and is proved by Section 7.2. When M —> œ we may drop all heavy graphs 
in the R* scheme (so also g¥, z,, M* drop out of consideration). This then 
gives (8.1.4), which is the decoupling theorem. 

Mass-independence is true because we have arranged all counterterms to 
be polynomials in the light mass of the appropriate degree. 


8.5 Renormalization-group analysis 


When one computes a graph containing lines for fields with widely different 
masses, one finds, in general, that its value gets large as a power of the 
logarithm of the mass ratio. Such large coefficients are undesirable in a 
perturbation expansion, for they mean that the reliability of using a few 
low-order terms is worsened. This situation arises in both strong- and 
weak-interaction physics. We will now show how to combine the decoup- 
ling theorem and the renormalization group to do calculations without 
their being made unreliable by the large logarithms. 

A convenient method is to use a mass-independent scheme (specifically 
minimal subtraction) for high-momentum calculations, where one often 
wishes to neglect all masses, and to use the R* scheme, as defined in 
Section 8.4, at low momenta, where one wishes to neglect heavy graphs. An 
advantage of this method is a simplification of many of the calculations 
needed to match high-energy and low-energy calculations. One needs only 
the pole parts of graphs and the values at zero external momentum. 

We will explain how to use this scheme in the toy theory (8.1.1). First let 
us write the RG equations for the Green’s functions. For a Green’s function 
of N, light and N, heavy fields, we have 


d 
(uz +3Niy + Nn) Gyn, = 0, (8.5.1) 


where 


d ð ĝ ĝ 
— = u— + : aces esis 
ET ET PACH DFP PACH 92), 


ð 0 
— (M?Yu + m7y,4) —(my,,+ M? Ym) (8.5.2) 


ðM? ôm? ` 
The RG coefficients are obtained from the renormalization counterterms as 


usual. Their lowest-order values are 


1 
1287? 


B,=- S9i+9:93+93)+°°:, 
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1 
p,=- [ogg 492 + 9193-69192) +, 


o l 
~ 19273 


yi g+, 


(gi + 93)..-. (8.5.3) 


Yn 38473 
In the effective low-energy theory, the RG equation is 
d* 
ce + iny (os = 0, (8.5.4) 
u 
with 


To compare the low-energy theory and the full theory, we extended the 
renormalization scheme of the low-energy theory to a renormalization 
scheme R* for the full theory. In this scheme the RG operator has the form 


d* 0 0 0 
ee pie ee pee 


ð 
— (m*?y* + M*?y* )——, (8.5.6) 


— (M*?y%, + m*?yž u) maT? 


ô 
ôM*? 
and the anomalous dimensions of the fields are y* and y*. In fact 8*, y*, and 
y* are identical to those in the low-energy theory (see (8.5.4) and (8.5.5)), 
while B$ = yž* u = yin, = y% = yf = 0. This is easily seen by examining the 
Green’s functions which provide the normalization conditions for the 
renormalizations. 

For example, consider the inverse of the heavy propagator when both p° 
and m? are much less than M?: 


1/Go 2 = — ilp? — M*? + O(p%, m? p?, m*) | (8.5.7) 
which satisfies 
d* = —1 
Hae 20.2 = yi Go.2. (8.5.8) 


This is only consistent if y* = y% = y*ų = 0. 
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8.5.1 Sample calculation 


We wish to start with the full theory renormalized by minimal subtraction. 
In that version of the theory, we know the evolution of the couplings. Our 
aim is to compute Green’s functions in the low-energy theory and the values 
of the mass and coupling. The low-energy effective couplings are 


128n° 
REC) Sees ee er at ek 
g**(u) Zin u/A*) PER 
g* = fixed, (8.5.9) 
m*? = constant [1n (u/A*)] ~ +°, 
M*? = fixed. 


The effective couplings for the full theory with minimal subtraction are 
more complicated because they solve a coupled equation for two variables. 
To make the transition between the schemes we compute the lowest- 
order divergent graphs. We equate the self-energy for ¢, in the two schemes, 
with use of the Lagrangian (8.1.3) for the low-energy theory. This gives 


Fig. 8.2.4 + pole counterterm 
= Fig. 8.2.4 + zero-momentum counterterm + i(z — 1)p? — i(m*?z — m°). 


We thus obtain z and m*? as given by (8.2.6) and (8.2.7). To keep m*? finite, 
and not of order M?, we must replace m? by 


1 


Notice the presence of logarithms of M/y. If they are large enough, they 
invalidate the use of perturbation theory to compute g*, m*?, and z. 
However, the equations we write are valid at any value of u, so we may 
perform the calculations with u of order M. After computing g*, m*, and zin 
terms of g,,g,,m, and M, we can evolve them to the value of u that we wish 
to use for calculations in the low-energy theory. 

A convenient point to do the matching is where g, = g*, i.e., at u* = pĝ, 
where uê = M’e’/4n. Then for a general value of u we have 


*2 = e a 
PW) = TaZ) + 6/1281) H/m) H 

A similiar equation holds for m*: 
m*? (u) = m [(1287?/3g?) + In (u/uo) +] +2. (8.5.11b) 


The solution for z is more complicated since the renormalization-group 
equation for both renormalization prescriptions is needed. 


(8.5.11a) 
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8.5.2 Accuracy 


We compute g*() in the low-energy theory by matching to the full theory 
at some p, of order M and then evolving to an arbitrary renormalization 
mass u from uo. Given the accuracy in g*(u) that we need for a particular 
calculation, we will find the order to which we must perform the matching 
and to which we must know $. 

The RG equation for g*(u) gives 


glu) 


In (u/Ho) = | dg’/B(9’). (8.5.12) 


g(Ho) 
So if there are small errors Ag(u,) and A(1/B) in g(u,) and 1/B then the error 
in g() is 


gluo) ' : 
Ag(u) ~ Bla Wl eT ad alaan. 


Z ofaw h aes - [eo aa jl} 


Suppose we perform matching up to n„-loop order; then the error in g()) is 
of order g(u,)’""* °. Suppose $ is computed to ng-loop order; then the error 
in 1/B(g) is of order g?"*~ >. These translate to errors in g(y) of order 


g(t)? gluo) ”” 
and 


glu)’ In [g(u)/g(uo) ] if n, = 1, 
or 


O[g(u)**?"*] + O[ g(u)? gluo) Y] ifn 22. 


For example, if we wish to perform reliable two-loop calculations, then 
we need Ag to be much smaller than g°. This means that we need to do the 
matching correct to one loop and that the B-function is needed to two loops. 
This is the minimum accuracy needed to correspond to a fractional error on 
A (defined in Chapter 7) which is much less than unity. 
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Global symmetries 


In this chapter we consider the impact of global symmetries of a field theory 
on its renormalization. As an example consider the theory of a charged 
scalar field: 


L = dp' 0p -m o'p — gop) /4. (9.0.1) 


This classical Lagrangian is invariant under the transformation ¢ => e~ 29., 
The quantum theory is also invariant. For this particular theory, the 
quantum invariance is not a very deep statement. However, symmetries do 
not always survive quantization, as we will see in Chapter 13. Thus it is 
useful to examine the consequences of the symmetry in this theory. One 
consequence is that only invariant counterterms are needed; for example, 
we do not need to use non-invariant counterterms proportional to 


p? a pr? or ilo? a p'?). 


Other consequences are the W rd identities, which characterize the action 
of the symmetry at the level Of Green’s functions. 

The main step in proving the statements is to impose an ultra-violet cut- 
off. If this is done by putting the theory on a lattice or by using dimensional 
regularization, the symmetry is preserved. The arguments given in Section 
2.7 are sufficient to prove Ward identities in the bare theory. From the 
invariance of Green’s functions follows invariance of the counterterms. As 
we will see in Section 9.1 we can then write renormalized Ward identities in 
the renormalized theory, which therefore exhibits the symmetry. 

In more general cases this simple procedure fails. 

One case is that the UV cut-off breaks the symmetry. For example, 
putting the theory on a lattice breaks Poincare invariance. Luckily, other 
regulators, like dimensional continuation, preserve this invariance, and the 
renormalized theory with no cut-off is Poincaré invariant. Some sym- 
metries cannot be preserved after quantization. It must be true that no 
regulator can preserve them. An example, to be treated in Chapter 13, is the 
chiral invariance of QCD. 
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Another case, which we will treat later in this chapter, is of spontaneous 
symmetry breaking, typified by the theory given by (9.0.1) with m? replaced 
by — m°. This is called the Goldstone model. In this case the ground-state — 
the vacuum - is not invariant under the symmetry, and the field acquires a 
vacuum expectation value: 


<0|¢|0> = [2|m?|/g]*/? + higher order. 


If we use an invariant regulator, like dimensional continuation, we will still 
be able to prove Ward identities. Hence, we will be able to prove that only 
symmetric counterterms are needed, so that the symmetry is preserved. 
From the Ward identities follows Goldstone’s theorem, that there is a 
massless boson for each generator of a broken symmetry. 


9.1 Unbroken symmetry 


We first consider a totally unbroken internal symmetry. The fields carry a 
matrix representation of the generators. Thus: 


619i = — ilta) j, (9.1.1) 


in the notation of Section 2.6. 

The proof that the symmetry can be preserved under quantization is 
elementary. We spell out the steps so that we can see what needs to be done 
in less trivial cases: 


(1) Regulate in a way that preserves the symmetry. Lattice and dimensional 
regularization both do this since the symmetry commutes with all 
Space-time transformations. 

(2) Include in Y all possible invariant counterterms up to the appropriate 
dimension. Thus 6,4 =0. For the model (9.0.1) we replace Y by 

L = Zdg' db — mEpo — Isl p) /4. (9.1.2) 

(3) To do perturbation theory, let the free Lagrangian be invariant: 
0, £5 =0. Then the interaction Lagrangian is also invariant. 

(4) At each order, choose the counterterms to cancel the divergences in 1 PI 
Green’s functions. Since the free propagators and the interactions are 
all invariant under the symmetry, the divergences are symmetric and 
non-invariant counterterms are not needed. 

(5) Remove the UV cut-off. The Green’s functions are symmetric: 


6,4 0|T$j,(1)---;, (Yn)|0> =0. (9.1.3) 


In the case of the model (9.0.1) the propagator for the charged field 
Carries an arrow indicating the direction of flow of charge. All vertices have 
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equal numbers of ingoing and outgoing lines. In (9.1.3) we have 6@ = — id 
and 6¢'=id', so this equation is literally a statement of charge 
conservation. 

The current for a symmetry is defined by Noether’s theorem 
(Section 2.6): 


pi 9.1.4 
i= 2 Ce < = oe (9.1.4) 

In the case of the simple model (9.0.1) there is a single current 
phe =izotord. (9.1.5) 


We derived the Ward identities of the bare theory (2.7.6). For the theory 
(9.0. these are 


oa COLTA) PONEC) AEn] 


N 
=i $ [dx — y) ôx — zI] CO TAY:) PON z) P"(Zy)[9D. 
l (9.1.6) 


We showed in Section 6.6 that the current is in fact finite; no extra 
renormalization counterterms are needed beyond those implied by the 
factor Z in (9.1.5). 

It is of interest to see how the divergences that are present get cancelled by 
the factor Z. For the two-point function of j* we have the 1PI graphs of 
Fig. 9.1.1, up to order g*. Since Z = 1 + O(g’) in this theory, we may replace 
Z by 1 everywhere except in the tree graph (a). Graph (b) could be 
logarithmically divergent by power-counting, but is in fact zero, so no 
counterterm is needed at order g. Graph (e) is also zero. Graphs (c) and (d) 
are finite after their subdivergences are cancelled by a counterterm; they 
also cancel each other. These cancellations arise since these graphs have a 


oy 
XX A X 


Fig. 9.1.1. Graphs up to order g? for the two-point function of j”. 
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subgraph which is a graph for 
<0| Tix) 6'(y)[0>. 


In momentum space this is of the form q“ f (q*). The Ward identity implies 
that its divergence is zero: 


<0 T(x) $'4(y)|0> = idx — y)<0]4(G'4)0> =O, 


so that q? f(q7) = 9. 

Graphs (f) and (g) each have a subdivergence which is cancelled by a 
graph of the form (b), which is zero. Their overall divergence must be 
cancelled by using the order g? term in Z in graph (a). 


9.2 Spontaneously broken symmetry 


To explain the renormalization of theories with spontaneously broken 
symmetry it will be sufficient to consider the case of the Goldstone model: 
L£=Z6,h'0"d +m pth — g(h')"/4 + ôm pp — 59h" $)*/4 
= (06 1)"/2 + (3)"/2 + mlhi + $3)/2 — gli + $3)*/16 

+ counterterms. (9.2.1) 

Here we have written the complex scalar field in terms of real fields: 

$ =(¢, + ig,)2~ ‘/*. The mass term is of the ‘wrong sign’. This will result in 

spontaneous breaking of the symmetry under ¢— de®. The Noether 

current for this symmetry is 

jt =iZb'O" = Z( 0". — $20"). (9.2.2) 

For small couplings the Euclidean functional integral is dominated by 
fields close to the minimum of the potential in (9.2.1). This 1s at 

|p| =2m/g’’. (9.2.3) 


The perturbation expansion amounts to a saddle point expansion about the 
minimum. It is set up by making the substitution 


pı =b; + 2m/g'”, (9.2.4) 
to give 
L =(09;)?/2 + (062)°/2 —m’*b? — gb? + $3)°/16 
— mg bi (oy + 63)/2+ La, (9.2.5a) 
where 


L a= — 69 bY + $3)?/16 — dgmg™ oilo? + 63)/2 
— pX (3m°ôg/g — dm?)/2 — 63(m?5g/g — dm?)/2 
— 2'mg~ '!?(m?5g/g — õm?) + (Z — 1)(0o'2 + 02)/2.  (9.2.5b) 
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The idea of making this perturbation expansion is that in the functional 
integral we impose a boundary condition that fixes the phase of the field at 
co. By the symmetry we may make this phase real, without loss of 
generality. In three or more space-time dimensions, fields that have a 
different phase over a large region have an action so much larger that 
quantum fluctuations cannot destroy the boundary condition. Then ¢, is 
forced to have a real vacuum expectation value close to 2m/g?’”. 

In setting up the perturbation expansion we have tadpole graphs like 
Fig. 9.2.1. These generate a vacuum expectation value for ¢; 

<0]; [0> = dv 
that starts at order g+. It means that ¢, has vacuum expectation value 
2mg~'!* + ôv. There are then graphs like Fig. 9.2.2, where the tadpoles 
appear as subgraphs. It is possible to recast the Feynman rules by writing 
pı = pi + 2mg~ 1? + ôv and requiring $7 to have zero vacuum expectation 
value. A better practical approach is to impose ĝv = 0 as a renormalization 
condition on 6m’. 


ho sig tones 


Fig. 9.2.1. Graphs for (0|@;|0>. 


2 YS 


Fig. 9.2.2. Graphs containing tadpoles as subgraphs. 


If we start with the theory (9.0.1) without spontaneous symmetry 
breaking and vary m? until it is negative, then we should pass through a 
phase transition and thereby reach the Goldstone model (9.2.1). There must 
be an actual phase transition because <0|@|0> is exactly zero in the phase 
with unbroken symmetry. Since this expectation value is non-zero in the 
Goldstone phase there must be non-analyticity of the theory as a function 
of m°’. 

Now, we must renormalize the theory: the continuation in the re- 
normalized mass m° is sensible only if the counterterms are the same 
functions of m* and g in the two phases. It is sensible to use a mass- 
independent renormalization prescription, for then the dependence on m? 
of the counterterms is the simplest possible. We will prove the following: 
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(1) Renormalization of the Goldstone phase is accomplished by using only 
symmetric counterterms in the Lagrangian (9.2.1). 

(2) The dimensionless counterterms Z — 1, ôg, and 6m?/m? can be chosen 
to be the same as in the phase of unbroken symmetry (the so-called 
Wigner phase). 

(3) The current given by (9.2.2) is finite just as it is in the Wigner phase. 
Since the bare Lagrangian is invariant under ¢—d¢e~'®, Ward 
identities are valid and from them Goldstone’s theorem follows, that the 
physical mass of @, is exactly zero. 


We must also discuss the choice of a practical renormalization prescription. 


9.2.1 Proof of invariance of counterterms 


We will do perturbation theory by choosing the free Lagrangian 


Ly = 07/2 + 63/2 -me (9.2.6) 
and the basic interaction 
L, = — glp? + $3)/16 — mg? pilo? + $3)/2. (9.2.7) 


The counterterms are given the form (9.2.5b) and ôg, 6m?/m*, and Z are 
given the same values as in the unbroken theory with a mass-independent 
renormalization scheme. We will prove that these counterterms are 
sufficient to make the broken-symmetry theory finite. 

Some of the interaction vertices are the same as in the Wigner phase. The 
others are obtained by substituting 2m/g'/* for ¢,. Therefore graphs 
involving the extra vertices are obtained by erasing external ¢, lines on 
symmetric graphs. Examples are shown in Fig. 9.2.3. The only complication 
is that mass terms generated from the basic interaction go into the free 
rather than the interaction Lagrangian. This is the sole source of 
complications in our proof. 


1 2 
(a) > +other terms 
l 2 


1 1 
(b) $ Xa L Ex other terms 
1 1 


Fig. 9.2.3. Generation of graphs in theory with spontaneously broken symmetry 
from graphs in the symmetric theory. 
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To relate the counterterms to those in the unbroken theory let us write 
the free propagators as follows: 


ee i(2m? — M?) i(2m? — M?) 
P1502 Im? pM? * (pM (p?— M?)°(p? — 2m) 
i i —iM? iM* 
O22 PM? PMP Mp? (9.2.8) 


Here M? is a arbitrary parameter. We substitute (9.2.8) for every line in a 
graph. 

Suppose we substitute the first term on the right of (9.2.8) for every line of 
a basic graph which has only four-point basic vertices. Then we obtain a 
graph in the symmetric theory with mass M. 

The difference between these symmetric graphs and the true theory is 
given by: 


(1) graphs with one or more three-point vertices, 
(2) graphs with the second or third term on the right of (9.2.8) substituted 
for one or more propagators. 


In either case the degree of divergence is reduced. Now the maximum 
degree of divergence is two. So substitution of the third term in (9.2.8) 
always makes a graph overall convergent. We are allowed at most one 
substitution of the second term. 

Let us now suppose that all graphs with fewer than N loops are 
successfully renormalized by our symmetric counterterms. We will prove 
inductively that all N-loop graphs are renormalized. The induction starts 
because tree graphs need no renormalization. We decompose the mass 
counterterm in Z q as 


— 31 [Bm ôg/g + (Zm — 1)(— m? — M?) + (Zm — 1)M°] 
— 33[m76g/g + (Z,, — 1)(— m? — M?) + (Z„ — 1)M?]. (9.2.9) 

Here Z,, =(m* + 6m7)/m? is the mass renormalization factor. 
After substitution of (9.2.8) for each propagator in a basic 1 PI graph with 
N-loop all subdivergences are cancelled by counterterms of lower order, 


according to the inductive hypothesis. We are left with the following overall 
divergences: 


(1) Logarithmically divergent graphs for the four-point function with all 
propagators set to i/(p* — M?) and with only four-point vertices. Such 
graphs have an overall divergence independent of M which is removed 
by counterterms in 6g for the symmetric theory. No other 1 PI graph for 
the four-point function has an overall divergence. 


(2) 


(3 


N 


(4) 
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Self-energy graphs with four-point vertices only and with propagators 
i/(p? — M°). Field renormalization and the (Z,, — 1)M? terms in (9.2.9) 
renormalize these, again exactly as in the symmetric theory. 
Self-energy graphs as in (2) but with one propagator replaced by a 
second term in (9.2.8). In the numerators of (9.2.8) we write 

2m? — M? = 3m? — (m? + M?) = (2m/g!/”)?(3g/4) — (m? + M7), 

— M? = m — (md? + M?) = (2m/g'!")?(g/4) — (m? + M?). (9.2.10) 

The terms with —(m*+ M°) are renormalized by the (Z, —1) 
(— m? — M?) parts of the mass counterterms. They correspond to the 
effect of differentiating the self-energy graphs with respect to M?. The 
other terms in (9.2.10) we will regard as an insertion of a four-point 
vertex on a line when two ¢, fields are replaced by 2m/g'/?. These terms 
are considered under (4). 
Graphs of classes (1) and (2) in which one or more external @, fields are 
deleted and replaced by 2m/g'/*. Examples are Fig. 9.2.3(b) and 
Fig.9.2.4. The same replacement generates the counterterm 
Lagrangian (9.2.5b) from the symmetric theory, so we have counter- 
terms for them. 


This completes the proof. 


Was OE OM 
wince ae 


Fig. 9.2.4. Generation of graphs with loops in theory with spontaneously broken 


symmetry from graphs in the symmetric theory. 


9.2.2 Renormalization of the current 


The same procedure shows that the current 


B= L2(h,0"b2 — G20") + 2Zmg™ 0h, (9.2.11) 
l 2m/g'!? 
+ +(Z — KK 
2 y 24 
2 2 loop 


Fig. 9.2.5. Renormalization of current in spontaneously broken theory. 
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has finite Green’s functions. Note that the term 2Zmg~ '/*0"¢, contains the 
counterterms that renormalize graphs like Fig. 9.2.5. 
The Ward identities are then true and involve finite quantities. A typical 
case is 
6,,< 0| TI*(x)b2(y)|0> =i<0|ôp)0> 
= —1¢0]6,()|0> 
= — i(2m/g'/? + ôv). (9.2.12) 
By multiplying by the inverse propagator for ¢, and going to momentum 
space, we find 
P T$}, 2(p) = (2m/g*!? + dv)(i/G,,(p*)), (9.2.13) 
where I’, is the set of graphs for ¢0|Tj"@,|0> that are 1PI in @,. This 
is illustrated in Fig. 9.2.6. Since T” œ p” as p? — 0, (9.2.13) implies that G;,' 
has a zero at p? =0, in other words that ¢, is massless to all orders of 


perturbation theory. This is the Goldstone theorem (Goldstone, Salam & 
Weinberg (1962)). 


p(z fy GR )-o FO?) 


Fig. 9.2.6. The Ward identity that implies Goldstone’s theorem. 


9.2.3 Infra-red divergences 


Individual graphs with a self-energy insertion on a @, line have infra-red 
divergences. Such a graph is illustrated in Fig. 9.2.7, and the divergence 
comes from the region where the momentum k on the œ, line is close to 


zero: 
1 
d*k—,.. 
| kmo (K?) 


If uncancelled, this divergence indicates that the self-energy shifts the mass 
to a value other than zero. But the Goldstone theorem tells us that the self- 
energy is zero at k=0. So the infra-red divergence cancels against 
divergences in other graphs of the same order. 


Fig. 9.2.7. Graph with infra-red divergence. 
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9.3 Renormalization methods 


One of the practical problems that arises in making calculations in a theory 
with spontaneous symmetry breaking is to find the most convenient 
renormalization prescription. Fundamentally, there is no problem, for all 
renormalization prescriptions are related by renormalization-group trans- 
formations, and are therefore equally good. But, in practice, choice of one 
prescription over another can save some labor. The problems become 
particularly acute in gauge theories of weak interactions (Beg & Sirlin 
(1982)). 

Among the issues to be considered in choosing a renormalization 
prescription are: 


(1) If we ignore higher-order corrections, then some parameters are equal 
to quantities, like particle masses, that are easily measurable. It is often 
convenient to impose exact equality as a renormalization condition. 

(2) One must treat tadpole graphs. Their effect is to provide an additional 
shift ôv in the vacuum expectation value of the field. Leaving these 
graphs as they are considerably increases the number of graphs 
contributing to a given Green’s function. Shifting the field by ĝv gives 
many extra terms in the formulae for the coefficients in (9.2.5a) and 
(9.2.5b). One can impose ôv = 0 as a renormalization condition, at the 
expense of removing the simple connection between the phases of 
broken and unbroken symmetry (as was exploited in Section 9.2). 

(3) It is necessary to relate calculations done by different people. Direct 

comparisons can be made only if the same renormalizations are used. It 

is evidently useful to agree on a standard. 

If the coupling is not very small or if there occur very large ratios of 

masses and momenta, then one must choose a renormalization 

prescription with the ability to remove the large logarithms. 


(4 


sw 


One approach is to use dimensional regularization with minimal 
subtraction. Graphs can be renormalized by the forest formula. At one-loop 
order this amounts to subtraction of the pole part from each 1PI graph. We 
can do this without regard to the symmetry relations between counterterms 
for different Green’s functions. Since the counterterms have the pure-pole 
form, these relations are automatically satisfied. 

Another approach is to compute Z, ôg, and ôm? by three renormalization 
conditions imposed on some of the 1PI Green’s functions in the broken- 
symmetry phase. Then the values of counterterms for other Green’s 
functions are computed from (9.2.5b). It is convenient to determine 
m?5g/g — 6m? by requiring ôv = 0, i.e., <0|¢,|0> = 2mg~ !/? exactly. Then 
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Fig. 9.3.1. Renormalization of self-energy at one-loop order. 


the mass counterterm for ġ, forces the 1PI self-energy of ¢, to be exactly 
zero when k? = 0. The one-loop graphs are shown in Fig. 9.3.1. 

Both of these approaches require explicit computation of the values of Z, 
ôg, and ĝm? to find the values of counterterms for the various Green’s 
functions. It is also possible (Symanzik (1970a)) to use the Ward identities 
to generate renormalization conditions for all divergent 1PI Green’s 
functions from the three basic conditions. These conditions are simple if the 
three basic conditions are imposed at zero external momentum. 


9.3.1 Generation of renormalization conditions by Ward identities 


The general Ward identity is 
ð N 

za OLT) TI ,,(¥) 10> 
j=1 


i | 
= —i [[ dx — yO Tp, (x) [I ¢,,(¥)|0>. (9.3.1) 
j=1 i#j 


In the Goldstone model, the labels n; take the values I’ or 2, and we have 
ô$; =, and 6¢, = — ¢, = —(¢; + 2m/g'’*). For simplicity we impose 
the condition <0|¢1|0> =0. Suppose we have obtained renormalization 
conditions valid up to !—1 loops. We will now find the appropriate 
conditions for l-loop graphs. 

The case N = 1 was given in (9.2.12) and (9.2.13), and in Fig. 9.2.6. We saw 
that one renormalization condition on the self-energy of ġ, is that it is zero 
at p? =0. Another condition on the derivative can be chosen arbitrarily, 
corresponding to the freedom to multiply Z by a finite factor. Suppose we 
choose to make the residue of the Goldstone pole equal to unity, and we 
choose to make ôv = 0. Then we also obtain the renormalization condition 
on the Green’s function <0|Tj*(x)¢,(y)|0> of the current with ¢,. The 
condition is that it is equal to its lowest-order value at p = 0. This condition 
is equivalent to making the counterterm equal to 


2mg~ *!7(Z — 1)0",, 
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as required if j“ is to be the. Noether current. 
Similarly we may treat the case N =2: 


6, 9| Ti*(x)64()G 2(2)|0 > 
= —id(x — y)<0|T¢,(y)¢,(z)|0> 
+ id(x — z)<0|T $1 (y)64(2)|9), (9.3.2) 
where we used <0|¢ġ;|0> = 0. In terms of 1PI graphs in momentum space 
this gives Fig. 9.3.2. After use of Fig. 9.2.6 we find Fig. 9.3.3. 


Fig. 9.3.2. Ward identity for two-point function of j”. 


0) GE =i — 
2 i. q gi? 2 2 q a e 
-i(-@-) +i(-—O> 
1 IP 2 24 
Fig. 9.3.3. Result of multiplying Fig. 9.3.2 by two inverse propagators. 


We now set p +q =0 to eliminate the left-hand side. This gives 


2m. 
0 = iT 212(0,p, — p) ~ [p° — 2m? — E,(0)] + [p? - E4007) 


2m . 
= gaapt, p, — p) + 2m* — 2 (p°) — X,(p”). (9.3.3) 


Here 2, and 2, are self-energies and I`}, is the 1PI Green’s function for 
two @, fields and one į. We choose a mass renormalization condition for 
2, say X,(0) = 0. Since we already know that Z,(0) = 0, this tells us that the 
renormalization condition on T,,, is 


I, ,2(0, 0,0) = — img~'/* = lowest-order value. (9.3.4) 
Since graphs for I,,, are at worst logarithmically divergent we know 
I,,2(p,q,r) completely at this order. From (9.3.3) we can now determine 


x (p°). But the calculation of £ (p°) from its graphs is already fixed except 
for a renormalization condition that determines the value of the field- 
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strength counterterm. So (9.3.3) gives us the renormalization in such a way 
that the counterterm is — (Z — 1)p’. 

Similar arguments may be applied to give renormalization conditions for 
all the remaining 1 PI Green’s functions that have overall divergences. They 
are easiest to express in terms of Ward identities for 1 PI Green’s functions. 
(See Lee (1976), and references therein.) 

The structure of these arguments generalizes what would be done in the 
unbroken phase. For example, (9.3.2) integrated over x would give £, = £, 
in this phase. This condition would say that the counterterms for 2, and 2, 
are equal. But in the Goldstone phase this is not so. Integrating over x is 
equivalent to setting the momentum at the vertex for the current to zero. 
The derivative with respect to x gives a factor of this momentum, but since 
there is a pole 1/p? the right-hand side of (9.3.2) is not zero. The argument 
that we had to use is more complicated. 


10 


Operator-product expansion 


In this chapter we will investigate two closely related problems. We work 
with ¢* theory in d = 4 dimensions and consider a time-ordered product of 
two fields, T @(x)@(0), together with its Fourier transform 


T $(q)¢(0) = | d*xe't*T (x) (0). 


(It is easiest to work with time-ordered products. The methods work with 
any pair of operators T A(x) B(O) in any theory.) 

The first problem is to ask how T ¢(x)@(0) behaves as x” =O. If the theory 
were totally finite then the result would just be #7(0). However, there are 
ultra-violet divergences that prevent the product from existing, so the limit 
does not exist. It was the idea of Wilson (1969) that @(x)@(0) should behave 
like a singular function of x times the renormalized [ $7 ] operator, as x > 0. 
The full result is that we have an expansion of the form 


To(x)P(9) ~ 2 Co(x")[ O(0) ] (10.0.1) 


as x +0. Here the sum is over a set of local renormalized composite fields 
[0] and the C,(x)’s are c-number functions. This formula, or one of its 
generalizations, is called an operator product expansion (OPE), and the 
coefficients C, are often called Wilson coefficients. Corrections to (10.0.1) 
are smaller by a power of x” than the terms given. 

The second problem we wish to treat is the behavior of T(q)@(0) as 
|q?|—> œ. More precisely we will consider the momentum-space Green’s 
function 


Gy 42 = <0|TG(Q)GO)b(P;).-- d(Pw)|9, (10.0.2) 


when q“ > œ along a fixed direction with p,,..., py fixed. In other words we 


scale the invariants q* > x*q’, p,-q— Kp;'q. There is an operator product 
expansion 


Gya2~ ¥ ColQ)<0| TOOP)... d(Py)|D. (10.0.3) 
0 
The relation between the coordinate-space and momentum-space expan- 
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sions 1s elementary. Let us take the Fourier transform from x to q of a 
momentum-space Green’s function of T(x)@(O). Then the large-q be- 
havior is dominated by the singularities in x-space. The only relevant 
singularity is at x = 0. So C,(q) is the large-q part of the Fourier transform of 
C,(x). Conversely if one Fourier transforms G, ,, to get 


x x d4 me 
(0|T4(x)$0)$(P,)--. $(P»)|9> = | aae *Gnse (10.04) 


then the limit x > 0 fails to exist if Gy , , falls only as 1/q* or slower as q > œ. 

Thus knowing the large-q behavior is equivalent to knowing the singular 
small-x behavior, but the coordinate-space expansion also includes 
information on the leading non-singular part of the small-x region. 

These expansions have a number of uses, particularly in an asymptoti- 
cally free theory. There the perturbation theory when improved by the 
renormalization group gives an effective method of computing the Wilson 
coefficients. Among the uses are the following: 


(1) The expansion (10.0.1) in coordinate space provides a definition of 
renormalized composite operators that does not involve any reg- 
ularization (Brandt (1967)). 

(2) Although there is no physically important process which directly uses 
the limit taken in the momentum-space expansion (10.0.3), it 1s used 
indirectly for deep-inelastic scattering of a lepton on a hadron. This 
involves a matrix element of the form 


<P|i(Q)iO)|P>. 

Here q? > — œ, but with the ratio q’/q-p fixed instead of q?/q: p° fixed. 
A dispersion relation relates this case to the limit used in (10.0.3), so the 
OPE is used indirectly, as we will see in Chapter 14. 

The form and the method of proof of short-distance operator-product 
expansion can be generalized to handle many interesting high-energy 
scattering processes. (See Buras (1981) and Mueller (1981) for a review.) 
The results in the present chapter form a prototype for these other 
results. 
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10.1 Examples 


10.1.1 Cases with no divergences 


We will mainly restrict our attention to Green’s functions of $(x)@(0) in 
which both ¢(x) and (0) are connected to other external lines. This is the 
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x 0 Q x 
Pi ra \ ei va 
(a) (b) 
Fig. 10.1.1. Lowest-order graphs for operator-product expansion of $(x)@(0). 


case that is relevant to most applications. Our theory will be ¢* theory 
L = 07/2 — md? o?/2 — go*/24 + counterterms. (10.1.1) 
First consider the tree graphs Fig. 10.1.1(a) for 


<0|T (x) (0) }(P,)b(p2)|0>. (10.1.2) 
These give 
i i ; ; 
nre a er (10.1.3) 
Expansion in a power series about x = 0 gives 
i i j 
[2 -ilp +p) x — (prx? + pr x7)/2 +), 
Py — m p~ m (10.1.4) 


This is equivalent to the replacement 
T b(x)b(0) = p0) + 4x8 p? + ixx piep (10.1.5) 
as illustrated in Fig. 10.1.1 (b). This equation has the form of the operator- 
product expansion (10.0.1). 
Thus the operator-product expansion in this case (free-field theory) is 


really a Taylor expansion of ¢(x) about x = 0. The power of x in each term is 
just such that no dimensional coefficients are needed: 


C,(x) = constant x |x|*, with a = dim (0) — dim[¢(x)@(0)]. (10.1.6) 
This result also correctly gives the power-law behavior in the presence of 
renormalizable interactions, as we will see. But there will also be 
logarithmic corrections. 

A feature which does not appear to survive inclusion of interactions is 
that the series on the right of (10.1.4) is convergent and sums to give 
T(x) (0). 

Consider next the graphs of Fig. 10.1.2 for the four-point function of 
T $(x)(0). The important factor comes from the lines carrying momentum 
q: 

ig? dq _texp(—ig'x) +expli@—Pa—Ps)x]} 
(2n)* (q* — m’)[(p4 — 4)’ — m*] [Q -pa - Dp) — m°] 


(10.1.7) 
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H D4 Pg 


Fig. 10.1.2. Higher-order graphs for operator-product expansion of ¢(x)¢(0) 


We now expand the integrand in powers of x to obtain 
ig? | aq Pat PX 
(27)* (q? = m’)[(p, a q)? _ m°] [(q p= Dp)? 5 m?] +e 
(10.1.8) 


(10.1.7) ~ 


These first two terms are just those we would expect from (10.1.5). But the 
higher terms have at least two extra powers of q in the numerator and are 
therefore ultra-violet divergent. The divergences are those of the Green’s 
function of the composite operators. They indicate that modification of the 
higher terms of the expansion is needed. For example, the behavior of the 
coefficient of @0,0,@ is modified by a logarithm of x. 

Similar modifications will be needed for the coefficient of @*, when we 
consider higher-order corrections. Therefore we will find it convenient just 
to restrict our attention to the leading-power behavior, corresponding to 
the ° term in (10.1.5). 


10.1.2 Divergent example 


Aside from trivial propagator corrections the contribution of order g to the 
two-point function of T¢(x)@(0) is given by Fig. 10.1.3(a), which gives 
e7 iq’ x 
(Pi — T: (p? — m*)(p3 — m) font? —m?)[(q— P1 -P m] 
When x > 0 the integral diverges logarithmically. This is a symptom of the 
fact that there are two important regions of q that contribute. The first is 


(10.1.9) 


0 


x 
q =|44 ]-uv|+ os er 


(a) 


Fig. 10.1.3. Graph for operator-product expansion of #(x)@(0) with divergences for 
the operator. 
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where q is finite as x +0; the contribution is correctly given by replacing 
T $(x)(0) by $7(0) in the graph. The second region is where q becomes 
large, up to O(1/x) as x0; in this region the interaction vertex in 
coordinate space is close to x and 0. 

In the second region the loop is confined to a small region in coordinate 
space. From the point of view of p, and p, the loop is a point. So we should 
be able to represent the contribution of this region by an extra term in the 
Wilson coefficient of ¢: 


Th(x)$(0) ~ Cy2(x)[ 67], l 
Cy. = 1 + (G/1627)c,(x’). 
Let us now calculate c,(x°). 
Now the contribution of the first region is given by replacing To(x)¢(0) 


by 7(0). However, this operator has an ultra-violet divergence. So let us 
add and subtract the renormalized Green’s function of [ 67], i.e., 


<0|TL$7(0)]d(P1)$(P2)|9>, (10.1.11) 
to give the equation depicted by Fig. 10.1.3. The contribution of order 1 
from the first region is entirely contained in the term (b) representing 
(10.1.11): 


2 1 a. 2 ia 
e a, aa | ey 
(pi — m*)(p2 — m*) 16x* | J o anu 
(10.1.12) 


Here we have used minimal subtraction. The remainder, term (c), 1s 


(10.1.10) 


S  _=-9y 
(p? — m’)(p5 — m*)(2n)* 
? AG oe! UVdi nce} (10.1.13) 
a >ar, oo a os — 1verge . a 
Tg- m°)[(q — Py — p27 — m°] p 


When x —0 the contribution from finite q is of order |x|. But there is a 
contribution of order 1 from large q: this is the contribution to the original 
graph minus whatever is taken care of by graph (b). 

We can identify c,(x?) in (10.1.10) as the x +0 behavior of the curly 
bracket factor of (10.1.13) (aside from a normalization factor), since to 
lowest order 


—2 
—m?*)(p; — m°) 
Now the leading-power behavior of the curly-bracket factor is independent 
of p,,p, and m. This is easily seen by differentiating with respect to any of 


<0| T $7 $(P1)6(P2)|9> = (p? 
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these variables. The result is a convergent integral which goes to zero likea 
power of x when x +0. So we may define c,(x) by setting m = p, = p, =Q: 


ae | 2 
C,(x) = ater as afaa oy tat (10.1.14) 


The integral is easily done by using 


Mey = | dzze 7-7) 


(0) 
to turn it into a Gaussian form, with the result 


(x) = 4[y + In(— n? u?’ x?)]. (10.1.15) 


10.1.3 Momentum space 


We now Fourier transform Fig. 10.1.3 to obtain the O(g) contribution to 
<0| TSOA )AlP)]0>. 
As q? > œ we find 
—1 
(pj — m’)(p3 — a 
This gives a contribution to the term in the operator-product expansion 
(10.0.3) with © = ¢*. The coefficient is 


Fig. 10.1.3(a) ~ ay + O[1/(q2)3]. (10.1.16) 


Cy2(q°) = xa + O(g°), (10.1.17) 


ia 


which is just the Fourier transform of the order g term in the coordinate- 
space expansion, gc,(x)/(162*). The g? term in the coordinate-space 
coefficient is independent of x, so that it gives a 6‘*(q) in momentum space, 
and hence nothing at large q°. 


10.1.4 Fig. 10.1.3 inside bigger graph 


The expansion (10.0.1) or (10.0.3) indicates that the same asymptotic 
behavior as x > 0 (or as q> ©) is obtained independently of the Green’s 
function considered. This happens because graphs like Fig. 10.1.3 can occur 


q [A - uv} 4 c,(x) 


Fig. 10.1.4. Even higher-order graph for operator-product expansion of $(x)¢(0). 
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as subgraphs of graphs with more external lines. An example is given in 
Fig. 10.1.4. 


10.2 Strategy of proof 


First we will make precise the limits in which the operator-product 
expansion applies. If we are in Euclidean space (i.e., with imaginary time 
and energy) then there is really only one way in which we can take x” to zero 
or q” to infinity. However, in Minkowski space we can let x? > 0 without 
each component going to zero, and we can let components of q“ go to 
infinity without q?—> œ. These cases are interesting physically. For 
example, the q” > œ limit with finite q? is the case of high-energy scattering. 
Much is known about these limits, but they are beyond the scope of this 
book. 

We will prove the coordinate-space expansion (10.0.1) in the case that all 
components of x“ go to zero with their ratios fixed. The corresponding 
momentum-space expansion (10.0.3) we will prove in the limit that all 
components of q“ go to infinity with a fixed ratio, and with q” not light-like, 
so that q? > œ. These limits are essentially Euclidean. 

Our proof will be in perturbation theory. The first step is to identify the 
regions of loop-momentum space that give the leading-power behavior in 
the x +0 or q —> œ limits. Then we generalize the arguments of the previous 
section, which applied to specific graphs. 

The region of large q which we investigate in the momentum-space 
expansion (10.0.3) is precisely the one to which Weinberg’s (1960) theorem 
applies. The theorem tells us to consider all subgraphs connected to the d(q) 
and (0) in (10.0.2). For each such subgraph we let all its loop momentum be 
of order g, and count powers just as we did for UV divergences. The 
subgraph(s) with the largest power of q? correspond to the dominant 
regions of momentum space. Then as q?— œ, the complete graph is 
proportional to this power of q? times possible logarithms of q’. 
Corrections are smaller by a power of q°. Although Weinberg’s result also 
tells us the highest power of In (q°) that appears, it is easier to determine this 
by first constructing the operator-product expansion and then applying 
renormalization-group methods (as in Section 10.5 below) to the 
coefficients. 

Essentially the same method can be applied to obtain the short-distance 
behavior, (10.0.1). For example, in Fig. 10.1.4 we have leading contri- 
butions with g finite or with q large (of order 1/x), but always with the lower 
loop momentum, k, finite. The leading power is (x”)°. The logarithm of x in 
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(10.1.15) for the corresponding Wilson coefficient comes from integrating 
over momenta intermediate between these two regions. 

Suppose we have large momentum confined to a subgraph T. Then the 
power of q (for the momentum-space expansion) is exactly the dimension of 
I’, since our theory is renormalizable, with dimensionless couplings. The 
leading power of g? comes from subgraphs with the largest dimension, i.e., 
subgraphs with the smallest possible number of external lines. This number 
is two (beyond ¢(q) and ¢(0)), so that the leading power is 1/(q?)?. The 
subgraphs have the form of the subgraph U in Fig. 10.2.1. In the ultra-violet 
subgraph U, all lines carry momentum of order q. This subgraph is 1 PI in its 
lower two lines. All momenta in the infra-red subgraph / are finite. 


k l 
CT 


Fig. 10.2.1. General structure of lead- Fig. 10.2.2. Factorization of Fig. 10.2.1. 
ing regions of momentum space for N- 
point function of ¢(x)¢ (0). 


Now, to the leading power of q?, the ultra-violet graph U is independent 
of the external momenta k and l flowing into it. Thus we may replace U by 
its value when k =! =0 (and we may set the mass m = 0). We may also 
replace the infra-red subgraph I by an insertion of a vertex for 7/2 in an N- 
point Green’s function. This is illustrated in Fig. 10.2.2. This has the same 
structure as the operator-product expansion. But it should be emphasized 
that we are supposing that loop momenta are restricted to certain regions. 
These regions are not defined very precisely, and it is one of the tasks of the 
proof to remedy the impreciseness. 

Schematically we have 


Y Gy 4 2(GsP1s+-+>Pw) 


graphs 
~ Y U(q,k = 0,1=0))) | d*kd*ld(k +1—)'p)I(k, 1 Py,--->Pw)- 
U I 
small (10.2.1) 


To construct the expansion we generalize the technique that we applied to 
Fig. 10.1.3. We consider each graph U that could appear in Fig. 10.2.1, but 
we do not restrict its momenta. It can occur as a subgraph of some graph for 
the complete Green’s function. If all momenta in U are of order q and if all 
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iy large momenta 


small momenta 


Fig. 10.2.3. A possible leading region for a subgraph of the form of U in Fig. 10.2.1. 


momenta outside are small, then we get a leading contribution to the cross- 
section. We can also have a leading contribution where the large momenta 
occur inside a proper subgraph of U -as in Fig. 10.2.3. Suppose we 
subtract off all of these contributions. Then we integrate over all loop 
momenta of U and find that the result only gives a leading contribution 
when all its momenta are large. We therefore define the contribution of U to 
the Wilson coefficient as 


C(U) = U — subtractions for regions of form Fig. 10.2.3 (10.2.2) 


all evaluated at k =1=m=0. 

The resulting formula for C(U) is very similar to that of the formula for 
renormalizing the ultra-violet divergences of a graph. In fact, as 
Zimmermann (1970, 1973b) explains, a good way to prove the operator- 
product expansion is to treat it exactly as a problem in renormalization. His 
method, used in the next section, is not to compute directly the Wilson 
coefficients but to define first a quantity which is a Green’s function minus 
the leading terms in its operator-product expansion: 


Ges _ 2, CoCo,w- 


This is constructed as a sum over graphs T for G,,,. Each graph has 
subtracted from it not only counterterms to remove ultra-violet divergences 
but also counterterms to cancel the large-Q (or small-x) behavior. The result 
we call Rẹ(T). 

Now, the subtractions that remove the large-Q behavior are a sort of 
oversubtraction. So R,(I) is simply related to R(T) in the style of a 
renormalization-group transformation. This transformation can then be 
written as the Wilson expansion, as we will see. 

A disadvantage of Zimmermann’s proof is that it uses BPHZ re- 
normalization. He takes advantage of certain short-cuts available through 
the use of zero-momentum subtractions. We will choose not to take these 
shortcuts so that minimal subtraction can be applied to ultra-violet 
divergences. Our method of proof will be essentially the same as the one 
used for problems with large masses (Chapter 8). 

The same techniques apply to the coordinate-space expansion. Here, the 
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momentum q is integrated over. When we do power-counting, large 
momenta are regarded as of order 1/x and small momenta as finite when 
x—> 0. The leading behavior again comes from graphs of the form of U in 
Fig. 10.2.1, and the power is (x*)°. There is a difference in the form of the 
possible graphs U that carry large momentum. All the graphs that we use in 
the momentum-space case are also used in coordinate space. But we can 
also have the graph consisting of the vertices for ¢(x) and (0) and of 
nothing else. 


10.3 Proof 


We must now prove the operator product expansion. In ¢* theory we 
consider the part of the Green’s function, 


Gy 4 2(X, Pi»: -> Pw) = <O TH(x)h0)O(P,)-P(Py)|O>, (10.3.1) 


in which each of ¢(x) and @(0) is connected to some of the other external 
lines. We will now scale x by a factor x and construct a decomposition of the 
form: 


Gye (KX, P43. -> Pyn) 
= C(k?x’)<0 Tilo Oep) -$(py)|9> + ry 42(KX, P15.. -s Py). 
| (10.3.2) 
In every order of perturbation theory, the coefficient C(x*x*) behaves like 
(x*)° times logarithms, when the scaling parameter x goes to zero, while the 
remainder goes to zero like a power of x. 
Fourier transformation on x gives the result 


Gy + 2(4/K, P1,---> Py) 
= C(q?/k?)<0| T5[7]6(p,): ; -$(Py)|O> + Fy 4 2(G/K, P15- - -> Py). 
(10.3.3) 
When q/k>œ, Č(q?/k?) behaves like «*/q* times logarithms, while 
Fy + 2(Q/K, P;,---) is smaller by a power. 

Our proof is given for a specific Green’s function in a specific theory. 
However, it can easily be generalized. Features specific to a gauge theory 
will be pointed out in Chapter 12. The particular case of QCD with the 
application to deep-inelastic scattering will be treated in Chapter 14. 


10.3.1 Construction of remainder 


We consider the set of graphs for Gy , 2. For each graph T, we will construct 
its contribution r (T) to the remainder Fy, ,. Each graph I we consider to be 


10.3 Proof 267 


an unrenormalized (but regulated) graph containing only basic interaction 
vertices. These are derived from 

L asic = CG? /2 — m*?/2 — p*“gh*/24, 
where g and m are the renormalized coupling and mass. 

As usual the renormalized value of the graph is R(T), which is I, plus a 
series of counterterm graphs to cancel its ultra-violet divergences. For our 
proof we will use a renormalization prescription in which the theory is finite 
when the renormalized mass m is set to zero. The renormalized value R(T) is 
then the contribution of T to the Green’s function Gy , 5. 

The remainder term r(I) is equal to F plus a somewhat different series of 
counterterm graphs. These counterterms will be constructed so that they 
cancel not only the ultra-violet divergences but also the leading x0 or 
q> æ behavior of I. 

Now r(I) isin effect a oversubtracted form of I. The oversubtractions are 
of the form of an insertion of the operator [°] times a coefficient. Thus 
R(T) — r(T `) is the Wilson expansion, i.e., the first term on the right of (10.3.2) 
or (10.3.3). A. 

The coefficient C(x?) (or C(q7)) depends on the coupling g and on the 
renormalization mass u. It must be independent of all the momenta. In 
order to be able to neglect m in the ultra-violet limit x > 0 or q > 0&0, we must 
use a renormalization prescription in which the counterterms do not 
become infinite when m — 0 (with fixed regulator). For concreteness we will 
use minimal subtraction in what follows. 

In order to define r(T-), let us recall the definition of the ordinary renor- 
malization R(T). This starts from the fact that the divergences of IF come 
from regions of loop momenta where all lines in some set of 1 PI subgraphs 
carry a momentum that approaches infinity. We label each region by the 
subgraph consisting of all the lines carrying large momentum. Then 


R(T) = -5 C). (10.3.4) 


The sum is over all subgraphs y of F, and C,(T) is essentially I with the 
subgraph y replaced by its large-momentum divergence. We define 
C (T) to be non-zero only if y is a disjoint union of one or more 1PI graphs 
Y1o--->Y,- In that case each y; is replaced by a counterterm vertex C(y,), 
which is the divergent part of y. To avoid double-counting, the sub- 
divergences are subtracted off first: 


ctx) = 9| v,- y cad | (10.3.5) 


O¢y 


268 Operator-product expansion 


Here ‘P’ denotes ‘pole part at d=4 and the sum is over all proper 
subgraphs 6 of y, As usual, if we use some other renormalization 
prescription than minimal subtraction the operator F is replaced by the 
Operator appropriate to the renormalization prescription. 

The definition of the remainder r(T) is almost identical to the definition of 
R(T). Now, the leading short-distance (i.e., x > 0) behavior of I comes from 
the following regions: 


(1) where the momentum gq is finite, 
(2) where q gets large and the momenta in a graph of the form of U in 
Fig. 10.2.1 also get large. 


Further leading contributions come from regions where in addition 
momenta get infinite in some set of divergent 1PI graphs. These extra 
contributions correspond to the ultra-violet divergences. In the 
momentum-space expansion (10.3.3) the same regions are leading except for 
the region of finite q. 

We define r(I) to be I with all ultra-violet divergences subtracted and 
then with all the leading small-x behavior subtracted: 


MT) = RIT) — ¥ $ L, (D). (10.3.6) 
ô 7 


Here the sum over ô is over all graphs of the form of U in Fig. 10.2.1 and the 
sum over y is over all subgraphs y of T that do not intersect 6. We use L, to 
symbolize a subtraction operation defined below. It is used to extract the 
contribution that comes from the region where the momenta in graph ô are 
of order 1/x and the momenta in y go to infinity. The case of finite q in 
Fig. 10.2.1 is covered by the case that 6 consists of the vertices for (x) and 
(0) only. 

We define the subtraction L, _ (T) to be zero unless y is a disjoint union of 
IPI y,,...,y,- In that case each y, is replaced by its overall counterterm C(y;) 
defined in (10.3.5) while the graph 6 is replaced by a quantity L(ô). L(6) is to 
contain the leading behavior of 6 when all internal lines have large 
momenta. This is the same idea as that C(y,) is the overall divergence of y;. 
Now there are regions where a subgraph 0’ of ô carries momenta of order q 
and other lines in 6 carry small momenta. To avoid double-counting we 
subtract them first. So we write: 


L(ô) = r| sy. ¥ Lust | (10.3.7) 
JE ô yNd=D 


Here T is to be an operator that picks out the leading x — 0 behavior of its 
argument. Now this behavior is independent of m, and of the finite external 
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momenta. (We prove this by differentiating with respect to the variables m 
and p’.) So we define T to set the values of m and the finite external 
momenta to zero. 

We now have a complete definition of r(I). 


10.3.2 Absence of infra-red and ultra-violet divergences in r(T). 


In Fig. 10.1.1(a) the only possible graph of the form of U is the one 
that consists of ¢(x)¢(0) alone. We call it 5,. Its value is e~'?!** + 
e P? x, It has no subgraphs so that 
L(6,) = (e721 * + e7 'P?"*) 
The remainder is therefore 
r(Fig. 10.1.1) = Fig. 10.1.1 — Fig. 10.1.1(5). (10.3.9) 


Here we regarded L(5,) as a [7] vertex. It is manifest that this remainder is 
exactly the graph minus its Wilson expansion. 

Now we turn to Fig. 10.1.3(a). It has 6, as a subgraph of form U and also 
the loop, which we call 6,. Then by (10.3.7) 


L(ô2) = {5, a L;,(52) }m=p: = p2=0 


= 2, (10.3.8) 


Pi “P270 


_ oo le eax] } 
7 Qn ) 4(q?—m)[(p, +p, — 9 m] eee 
igu’ (e~'4* — 1) 
= Oat | dé rene (10.3.10) 
Hence 
rT) = RO) — Ls, 0) — L(I) 
i? igt’ fer roe Mod 
~ (p? — m?)(p3 — m?) (2r) 4(q? — m)[(p, + P2 — 9)? — m?] 


- [a rer Serres dê ag 
4(q? — m?)[(p, + P2 — 9)? — m?] o i 
(10.3.11) 


where we use I to denote Fig. 10.1.3(a). 
The following properties hold: 


(1) L;,(I) is infra-red convergent even though it has zero mass and zero 
external momentum. Although 6, has an infra-red divergence when m, 
Pı, and p, approach zero, the subtraction term L, (6,) exactly cancels 
the divergence. 
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(2) Lə (T) is ultra-violet divergent, since in replacing the vertex 
6, =e '* by 1 we remove the ultra-violet cut off. However, L; (T) 
contains all the large-q behavior of T and of subtractions for subgraphs 
of 6,. Thus L, (T) cancels the ultra-violet divergence of L, (T). 

(3) When m, p,, p, approach zero, we find that 


r(ô2) = R(d,) — Lz, (62) — Ld) 
= 0, — Ls (02) — L(62) 
>Q. 


This is just the statement that L(ô,) is the value of ô, at m =p, = 
p, =0, after subtractions on subgraphs are made. 


The explanations of these properties are convoluted, but with the aim of 
demonstrating that they are true in general. Refer now to the general 
definition of r(T), viz., (10.3.6), and refer to Fig. 10.2.1 instead of Fig. 10.1.3. 
Then the above properties get replaced by: 


(1) L(ô) is infra-red convergent for any graph of form U in Fig. 10.2.1: the 
only regions that could give infra-red problems are cancelled by 
subtractions. 

(2) r(I)is ultra-violet and infra-red convergent if m, p? and p2 are non-zero. 
The individual terms L, (T) are IR finite. The subtractions remove all 
ultra-violet behavior. 

(3) r(6)=0 when m= p, = p, = 0. 


10.3.3 R(T) — r(T) is the Wilson expansion 
Although 


WT)=RT)-rt)=> L,( i cr) (10.3.12) 
á yo 35 O 
contains the leading x > 0 behavior of R(T), it is not yet in the form of the 
operator-product expansion, which is 


WT) = ¥ SRT /ô). (10.3.13) 
ò 


Here C(6) is the contribution of a subgraph ô (of form U) to the Wilson 
coefficient, while T'/ô is I with 6 contracted to a point, i.e., replaced by a ġ? 
vertex. R(I/ô) will now include pole-part subtractions for the divergent 
Green’s functions of °. 

Summing (10.3.13) over T can be done by independently summing over 6 
and I/6. This gives the operator-product expansion (first term of the right- 
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hand side of (10.0.1)) with 
Cy2(x7) = ¥ C(6). (10.3.14) 
ò 


(We have used the same symbol for the complete Wilson coefficient C(x) 
as for the contribution from a particular graph.) Hence to prove the 
expansion (10.0.1) or (10.0.3), we have to prove that W(I) as defined by 
(10.3.12) equals the right-hand side of (10.3.13). 

Any graph T for the Green’s function G, , can be written in the form of 
Fig. 10.2.1. In general there will be several possibilities for the upper 
subgraph U. For the following argument we will choose U to be the biggest 
possible graph. Then, in (10.3.12) for W(I), all the subgraphs 6 are 
contained in U. So we have 

d*kd*l 
W(T) = Ome W(U)R(I) 
= W(U)® R(J), (10.3.15) 


since no divergent 1PI subgraphs include parts of both U and I. Suppose we 
prove (10.3.13) when T is replaced by U. Then from (10.3.15) it follows that 


WT) = >. CRU /ô) 8 R(I) 
ô 
= Ý C(ô)R(T/ô). (10.3.16) 
ð 


The last line follows because (a) the graphs for I/é are of the form U/ô times 
I, and (b) ultra-violet divergent 1PI subgraphs are entirely within U/6 or 
within I. Notice that the extra divergences for T/ô as compared with T are 
due to the presence of the ¢* vertex. These divergences are confined to 
graphs of the form of U/o. 

Rather than prove (10.3.13) for the case [ = U, we use it as a definition of 
C(U), recursive in terms of C(6) for smaller ô: 


2C(U) = W(U)— Y C(5)R(U/6). (10.3.17) 
ôg U 


The factor 2 multiplying C(U) is the lowest-order 1PI Green’s function of 
the operator $7. It is evidently much too good to be true that we have 
reduced what ought to be a deep proof to a mere definition. The importart 
physics was at the previous step, (10.3.16). By implicit use of power- 
counting arguments, to restrict ultra-violet divergences to within U/ô, we 
proved (10.3.13) given its truth for the case I = U. The Wilson coefficient 
C2(x) as it occurs in the expansion (10.0.1) is then the same no matter what 
Green’s function we take of T¢(x)@(0). This universality is an important 
feature of the Wilson expansion. 
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The other important feature of the expansion is that the Wilson 
coefficient is a purely ultra-violet object. This will enable renormalization- 
group techniques to be useful in its calculation. For this purpose we must 
now prove that C(U) is independent of m, k and l. 

The proof is done by differentiating the right-hand side of (10.3.17) with 
respect to the mass m or with respect to one of the external momenta k or I. 
Let A represent this operation. It is applied in turn to each propagator in 
W(U) and in R(U/6). Inductively, we assume C(6) satisfies AC(5) = 0. This is 
true for the lowest-order graph, 6,, for which C(5,) = 1 follows from (10.3.8). 
In the general case AC(U) is therefore given by 

2AC(U)=AW(U)— ¥ C(5)AR(U/6), (10.3.18) 

ÖU 

as illustrated in Fig. 10.3.1. We have a series of terms in each of which one 
particular propagator of U is differentiated. The differentiation improves 
the ultra-violet behavior, so the differentiated propagator cannot be a part 
of a leading large-momentum region connected to the vertices ¢(x)@(0). We 
therefore factor out a maximal two-particle graph 6’, just as we factored out 
U in (10.3.15). 


C(d) 


2AC(U) = -È 
D Gu) 


Fig. 10.3.1. Differentiation of Wilson coefficient with respect to a mass or an external 
momentum. 


Now we can use the result 
W(ô') = 3 C(6”)R(6'/5”) 
ð” 


to give zero for AC(U). (This argument is analogous to (10.3.16).) The 
operator-product expansion is now proved. 


10.3.4 Formula for C(U) 


From the above results we may deduce 


c(u)= L| RW) 5 CORU | (10.3.19) 


ô$ U 


Heuristically this says that C(U) is that part of U arising when all its internal 
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lines have momenta of order 1/x, less contributions taken care of by terms 
C(65) in the operator-product expansion with smaller graphs ô contained in 
U. The term in square brackets is the unrenormalized U minus (a) all 
contributions involving momenta not all of order q, and (b) contributions 
with all momenta of order q taken care of by the subtractions under (a). This 
equation reproduces exactly the calculation performed in Section 10.1 of 
the O(g) Wilson coefficient, viz., 


C(5,) =[6,—- C(5,)R(52/5;) Im =p, =p2=0° 


10.4 General case 


The most fundamental form of the operator-product expansion is (10.0.1), 
which is proved by obtaining its matrix elements from the Green’s functions 
(10.0.2). We proved the Green’s function expansion restricted to connected 
graphs in ¢* theory, and restricted to the leading power of x*. The only 
operator that then contributes is © = °. Our proof generalizes. We may 
take the full Green’s functions in any theory and include non-leading 
powers of x. 

In the general case the operator L extracts the appropriate number of 
terms in a Taylor expansion in the mass m? and in the external momenta. 
The result is that each Wilson coefficient C,(x) behaves in each order of 
perturbation theory like a power of x? times logarithms of (x7?) times a 
polynomial in x“ and m? with dimensionless coefficients. If © is a tensor 
operator then the coefficient is also a tensor, as illustrated by the lowest- 
order example (10.1.5). 

The leading operator for @* theory is in fact the unit operator since it has 
lowest dimension. We have 


THO) = C,(x?)1 + Cya(x2)[6?] + O(x*-logs). (10.4.1) 


Operators linear in ¢ are prohibited by the ¢ > — @ symmetry. 
To compute the coefficient, C ,(x*), of the unit operator to lowest order, 
we use the graph of Fig. 10.4.1. We will extract terms up to O(x°), so that we 


x 0 


L 


Fig. 10.4.1. Lowest-order term for ¢0|T ¢(x)(0)|0>. 
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find: 
C,(x’*) = C,(x7)<0| l |0> 
= {Fig. 10.4.1 — Cy2(x?)<0|[¢7(0)]|0>} 


expanded in powers 
E to order m2 


_ § (d%q i’ * a d4q i 
| JOrn) q? — m? + ie (2n)4 q? — m + ie 
us 4m? 
p 8n (d — BN oe 
dêg 1 2 utim? 
= ix] 
lox mae ae ta a | + 3nd —4) 
1 


Sar Ts L pee ere atd = 4. (10.4.2) 


The Wilson coefficient is obtained by expanding in powers of mass and 
external momentum up to an appropriate degree, which is two here. We 
have no external momenta, so the expansion is in powers of m7”. 

In the next section, when we apply the renormalization group to compute 
the Wilson coefficients we will find that each coefficient is given as a series in 
the effective coupling with the renormalization mass p set to ( — x?)!/?. The 
main application of these methods will be to asymptotically free theories. If 
we truncate the perturbation expansion, then the error will be of the order 
of the first omitted term, and hence the fractional error is of order 

1/[In(— x?)]?*?, 
where p is the number of loops in the highest-order graph. This error 
dominates any positive power of x. 

Consequently, it is difficult to use the power-law corrections to the 
leading power in a Wilson coefficient. This would suggest we cannot 
properly use anything but the coefficient of the unit operator. However, in 
applications we will normally work with connected Green’s functions. For 
these the leading coefficient is of the two-particle operator $7. 


10.5 Renormalization group 


To make calculations for a Wilson coefficient, we must use the 
renormalization group to obtain maximum information from a low-order 
calculation in the way we will now explain. 

The coefficient has the functional dependence 


C(x?, g, u). 
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To use this we must set u” to be of order |1/x?| to avoid large logarithms, 
just as in calculating Green’s functions when all their external momenta get 
large (Chapter 7). The RG equations are simple, since, to the leading power, 
there is no mass dependence. (If we use non-leading powers of x then we 
have polynomial dependence on masses.) The renormalization-group 
equation for the Wilson coefficient can be derived most easily from the 
renormalization-group equations for Green’s functions in the following 
fashion. 
In the @* theory we have the renormalization-group operator: 


d ð ð „ô 
— = p— + p—- - 5. 10.5.1 
Pap ar ta Pa Ym om? ( ) 


The renormalization-group equations we need are 


d 
ari = —(N/2 + 1)yGR32’, (10.5.2) 


d 
ne Gul L971, JO" = (y, —INy)Gy([b?],...)". (10.5.3) 


Here, GÇPP denotes the Green’s function of N +2 external ¢-fields, 
restricted to connected graphs. G,([7],...)©°"" denotes the connected 
Green’s function of the renormalized [¢*] operator with N ¢-fields. To 
derive (10.5.3) we used the fact that 
m’ [p°] = mp? + coefficient times 1, 
and m = Z,m’. 
We apply the operator pd/du +(N/2 + 1)y to both sides of 
COITOS)... (Pn) |O> 

= Cy2(x)<0| TL ¢7]b(p1).-. d(Py)|0> + O(x”), (10.5.4) 

to obtain 


dC,2 2 ~ ~ 2 
O=| wg tt Cg (0|T[$*] $(P,).-- d(Px)|O> + OX). 
(10.5.5) 
Since the O(x?) terms are order x? independently of g, m, and u, they 


remain of order x? after ud/dp is applied. (As usual, ‘order x?” means order 
x? modulo possible logarithms.) Immediately we obtain 


d 
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This equation can be solved — it is effectively in the m=O theory. The 
anomalous dimension of C,. is the anomalous dimension of ¢(x)¢(0) minus 
the anomalous dimension of [@7]. This is the same relation as for the 
engineering dimensions. The signs in (10.5.6) arise from peculiarities of our 
definitions of y,, and y. 


11 


Coordinate space 


In the previous chapters we set up renormalization theory in momentum 
space. In this chapter, we will give a treatment in coordinate space. Now, the 
utility of a momentum-space description, such as we gave in the earlier 
chapters, comes from the translation invariance of a problem. However, the 
momentum-space formulation rather obscures the fact that UV diver- 
gences arise from purely short-distance phenomena. Thus a coordinate- 
space treatment is useful from a fundamental point of view. There are also a 
number of situations, essentially external field problems, where a 
coordinate-space treatment is the most appropriate from a more practical 
point of view. A particular advantage is that the coordinate-space method 
makes it easy to see that the counterterms are the same as with no external 
field. 

An important case, which we will treat in detail in this chapter, is that of 
thermal field theory at temperature T (Fetter & Walecka (1971), Bernard 
(1974), and references therein). There one works with imaginary time using 
periodic boundary conditions (period 1/T). 

It is first necessary to work out the short-distance singularities of the free 
propagator. This we will do in Section 11.1. A number of forms of the 
propagator will be given, whose usefulness will become apparent when we 
treat some examples in Section 11.2. The reader should probably skip to 
this section first and refer back to Section 11.1 as the need for various 
properties of the propagator arises. We will explicitly show that the 
particular counterterms that we compute are independent of temperature. 
Our arguments will be of a form that will readily generalize not only to 
higher orders but also to other situations. 

In Section 11.3 we will show to all orders of perturbation theory that 
counterterms at finite temperature are independent of T. We will do this by 
constructing counterterms in a theory in flat space-time in such a way as to 
make manifest the fact that only the short-distance singularities of the 
propagator affect the counterterms. 

Another case, which we will treat in Section 11.4, is flat space-time with, 


277 


278 Coordinate space 


Say, an external electromagnetic field. If the field is weak it can be treated as 
a perturbation, i.e., as part of the interaction. But if the field is strong, one 
must put it in the free Lagrangian. Thus the free electron propagator 
satisfies 


(id + eA — M)S,(x, y;eA) =10(x — y), (11.0.1) 


which cannot in general be solved by expanding in a series in the field 
(Schwinger (1951)). 

Another common external field problem is that of quantizing a quantum 
field theory in a curved space-time, where there is no remnant of a global 
Poincaré symmetry. Momentum space is then an inappropriate tool. One 
wishes to show that the UV counterterms can be kept the same as in flat 
space-time. We will not discuss this particular case here. But an extension of 
the techniques described should enable a fairly simple treatment to be 
given. 


11.1 Short-distance singularities of free propagator 


11.1.1 Zero temperature 


The UV counterterm of a 1PI graph is ultimately determined by the short- 
distance singularities of the free propagators that make up the graph. So we 
need to obtain these singularities; and we will do this in this section. 

It is sufficient to consider the scalar propagator 


d‘q jeit * 


2. _ 
SRE ea) (27)? (q? — m? + ie) 


(11.1.1) 


Propagators for fields with spin can be obtained by differentiating with 
respect to x. The propagator satisfies the equation 


(O + m?)S,, = —i5(x), (11.1.2) 


together with appropriate boundary conditions. When x is non-zero this 
equation reduces to 


(426?/dz? + 2dd/dz — m?)S, =0, (11.1.3) 


where z = — x”. Thus Sp = z+% “/*w(mz'/), where w satisfies the modified 
Bessel equation of order y=d/2—1. The particular solution we need is 
determined by requiring that the 6-function in (11.1.2) be obtained at x? = 0 
and that S, 0 as x? > — œ. 
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A standard method of solving (11.1.3) is to expand in powers of z: 


S= ¥ (zm’)"w, 
n=0 
=(— x?)*)\(— m’x’)"w,,. (11.1.4) 
There are two independent solutions. One is analytic at z = 0, i.e., it has 
a=0; the other solution is singular, with a= 1 — d/2. Sp is a linear 
combination of these solutions. The quickest way to compute the 
singularity is to observe that, if m = 0, then (11.1.1) gives 


Sp(m = 0) =T (d/2 — 1)/[4n7/7( — x?)*? 1]. (11.1.5) 
This normalizes the coefficient of the singular solution. The normalization 


of the regular solution is obtained from the properties of Bessel functions, 
by requiring that S,—0 as x” — oo. Then we find that 


1 as (“) T(d/2—-1—n) 


Se= gan apt 2G 7 


+ mi? (4n) 4/2 5 (m?x?/4)"T(1 — n — d/2)/n! 


= SFsing + S Fana’ (11.1.6) 


Here, we have used the series expansion of the Bessel functions. The 
coefficient of the regular solution can be obtained quickly by explicitly 
computing, from (11.1.1), that Sp = T (1 — d/2)m*~ ? /(4r)"? at x =0, if d is 
less than 2. 

For the purposes of renormalization we will need the singularities of S,. 
These are of two types: the singularities of Spsing as x0, and the 
singularities of Sr na as d approaches an even integer. The fact that most of 
the T -functions have poles when d is an even integer d = 2%, reflects the fact 
that the ansatz (11.1.4) is then incorrect for the singular solution. However, 
the correct result is obtained by expanding in powers of d — 2w and then 
letting d > 2m. The limits x > 0 and d > 2m are non-uniform. Application to 
graphs like the ° self-energy need x+0, but tadpole graphs (e.g., 
Fig. 11.1.1 in ġ4 theory) have a propagator with x = 0 for all values of d. So 


Fig. 11.1.1. Graph with a tadpole. 
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it is convenient to extract all the singular behavior of Sẹ, close to d = 2m and 
x = 0 by writing 


Sr oT Srca + Spa 


- 1 o-2 (m?x?\" T(d/2 — 1 — n) 
S Fsing = gro — x20 T L aren ee e 
Jal 


I (d/2 — 1 — n) i 
i (= nx?) e (d/2—w)(nt1—q@)! |f 


n 
ts mee 2 X) m2 x2 n] 
Sae ar e 
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Neither term is a solution of the equation for Sp. The analytic part now has 
a finite limit as d—>2%w, with no singular behavior at x = 0. The singular 
term also has a finite limit as d—2m if x #0. It contains the x-0 
singularities. But if we need S,.(x = 0) then we first take d <2, then x +0, 
and finally d— 2m. Then the only singular contribution is from the n= 
œw — 1 term in Syeing: 

(= 1)°m2°~ 2 yT? 


(d/2 — w)(4n)°(w — 1)! aah 


Seance = 0) == 


This result will be needed in evaluating tadpole graphs. 


11.1.2 Non-zero temperature 


Thermal Green’s functions at inverse temperature B = 1/T are obtained by 
Wick-rotating time t— — itr and then by imposing periodic boundary 
conditions (Fetter & Walecka (1971)). Thus the free propagator 
S(x — y; B) satisfies 


(— 62/ét? — V? + m?)S,-(t, $) = 6(1)6)(X), (11.1.9a) 
S,(t,X)—-0, asx, (11.1.9b) 
Sp(t + B, X) = S(t, X). (11.1.9c) 


All integrals over space-time are restricted to the time range between 0 and 
$. The propagator is obtained from the zero-temperature propagator 
S(x; 00) by writing 


Susp S Saa (11.1.10) 
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(Note that in Sp(x — y; B) both t, and t, are between 0 and $, so when we 
apply — ĝ?/ðt? — V? + m° to it the only ô-function in (11.1.9a) arises from 
the n = 0 term.) 

The only singularities in S(x; 8) are from the n = 0 term when x” = 0, so 
we have 


Sr(x; P) = Speing(*) + Srana(X; B), (11.1.11) 
where S;.ing is the same as in (11.1.7), with — x? = t? + x”. The analytic 
term is different. It differs from S,.,,,(x; 0) at zero temperature by 


Fana 


AS, = > Spelt + nB, x; 0), 
n+0 

which, in the range 0 < t < f, is a solution of the homogeneous equation 
(0O + m’)S,. = 0. Notice that it is not a function of x? alone, i.e., it is not 
Poincaré invariant. | 

Another, direct, way of deriving the form (11.1.11)is to expand S,(x; 8) in 
a power series about x” = 0, and then to require the differential equation 
(11.1.9a) to be satisfied. The 6-function on the right-hand side ensures that 
Sy = Spsing(X) + Spana(x; f), where Spsing Is the same singular solution as in 
(11.1.6). Then Srana is a solution of the homogeneous equation analytic at 
x =0, and for which Sing + Spana Satisfies periodic boundary conditions. We 
then add and subtract the poles at d = 2w to obtain (11.1.11). The coefficient 
of the pole in Spana is the same as at zero temperature, because it must cancel 
the pole in S which is independent of temperature. 


Fana 


Fsing? 


11.2 Construction of counterterms in low-order graphs 


To explain how renormalization works in coordinate space we consider the 
graphs shown in Figs. 11.2.1 to 11.2.4 for t theory in four space-time 
dimensions. These are sufficient to show the various complications that 
occur. 

Our treatment works as well at any temperature. 

The simplest example is the one-loop correction to the propagator, 


x z y 
(a) (b) 
Fig. 11.2.1. Self-energy graph and counterterm. 
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Fig. 11.2.1. Its contribution to the full propagator is: 


1 
M,= -> + {ateSy(s — z,d;1/T)S,(z — y,d;1/T)S,(0,d;1/T). 


(11.2.1) 
The functions S,(x — z) and S,(y — z) are singular at z = x and at z= y, but 
the singularities are integrable, so that they do not contribute any 


divergence. The factor S,(0,d;1/T) is divergent, so we use (11.1.8) and 
(11.1.11) to write M, as 
i Z 
M, = -4 [ates — Z) S (z — Y)S fana (0, d = 4;1/T) 


2 


a EE E +0(d-4). (11.2.2) 


m 
8n? (d — 4) 
The divergent term is evidently exactly cancelled by the mass- 


renormalization counterterm, Fig. 11.2.1(b). The result for the renorma- 
lized propagator at order g is 


J Z |atesetx Z z)Sg(z = Y)S kana (0, 4, 1/T). (11.2.3) 


Notice that the renormalization only involved the singular term in Sp. Thus 
we have shown that the counterterm is the same at any temperature. 
The unrenormalized graph of Fig. 11.2.2 for the four-point function is 


M, = 2( = int? [ayatss 5, — y)Se(x2 — y)Se(y — z)? x 


x S,(z — x3)S,(2z — x4). (11.2.4) | 


The only divergence as d approaches 4 comes from the singularity of 
S,.(y — z}? at y =z; it is logarithmic. Let us write 


Msg [etvatza5( — z)*f (y, z,{x;}). (11.2.5) 


Since the divergence is logarithmic, it is governed by the leading behavior of 
x; X3 
, ~ xX X 
(a) (b) 
Fig. 11.2.2. Vertex graph and counterterm. 
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the integrand of (11.2.5) as yz. Thus we can generate a counterterm, 
illustrated in Fig. 11.2.2(b), 


M,= — [atratea'sl y, {x;}) [pole part ot | d’zSp,(y —2z)’]. 
oe (11.2.6) 


The singularity is entirely given by the first term S,, in the expansion 
(11.1.7) of 5 


Fsing 


Se, = (d/2 — 1)/[4r"?( — x7)477?]. 
The pole-part factor in (11.2.6) is then 


T (d/2 — 1)? 
- pole} | d“2S yah - poled Ea | dte(— 27-4 
z™y . 16x 2~0 


: 2 d/2 
= Poe i dzz?~4| 
16n T(d/2) lo 


4 


O Oa 
 8n24—d 
In the first line we shifted z to z + y. In the second line we Wick rotated z°, 
and the notation |, _ „dz indicates that only the region near z = 0 matters. 
The factor p4~* in the last line is needed, as usual, to preserve explicit 
dimensional correctness. 
We thus find that the renormalized M, is 


(11.2.7) 


M, = lim |d4zd‘y} g*S (pp TE Bai) x 
d>4 j 8n? (4 — d) 


| x f(y, 25 {x;}). (11.2.8) 
The factor in square brackets is a well-defined distribution at d = 4. Again 
observe that we only used the singular part Spsing of Sp in obtaining the 
counterterm, so that the counterterm is temperature-independent. 

The graph Fig. 11.2.3 has a logarithmic subdivergence as well as an 


Xı y X3 
X i) XX 
X2 X4 

(a) (b) 


a (c) 


Fig. 11.2.3. Two-loop vertex graph and counterterms. 
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overall divergence. Its unrenormalized value is 
M, =3(—igpu*~ 4) [atwatyatzsete — w)S,(x, — w)S,(x3 — y) x 
x Sp(X4 — Z)Sp(w — y)Sp(w — z)Sp(y — z)? 


= 9° | dwd yd4zf (Ww, y, z, {x:})Sp(w — y)Sp(w — 2)Sp(y — 2)”. 
(11.2.9) 


If w is not close to y or z there is a logarithmic divergence when y 
approaches z. This is identical to the divergence in Fig. 11.2.2. There is also 
an overall divergence when all of the interaction points w, y, and z approach 
each other. 

We first subtract the subdivergence by the counterterm (11.2.7): 


R(M;) = g? [Fw Yy, Z, {x;})Sp(w T y)Sp(w = Z) x 
i ui-4 

x | S.(y — 2? + — Py — 2) |. 11.2.10 
The distribution in square brackets is singular at y =z. However, it is 
integrable; that is, it produces a finite result when d goes to 4, if it is 
integrated with a test function continuous at y=z. The function 
f® (w, y, 2)Sp(w — y)Sp(w — z) is continuous at y = z, unless also w = y =z. 
The remaining divergence in Ř(M,)comes from the region w ~ y ~ z, where 
f® is not continuous. (There is also a singularity if y and z are both equal to 
x, OF x4; however, this region is again integrable.) The divergence at w = 
y=z is logarithmic, so again it is determined entirely by the leading 

singular terms of the propagators. The counterterm is therefore 


C=- fawo ww w, (x) pole] [atvasSy 0Sa x 


x [s02 toe Ji (11.2.11) 


as shown in Fig. 11.2.3(c). Recall that the factor in square brackets is 
singular but integrable at y = z. So the only divergent behavior comes when 
Sp(y)Sp(z) is singular, i.e., at y=z=0. It is easily checked that the 
counterterm C then reduces to 


— 72,,24- 8 2 1 
C= fanm wm D een || amar a (11.2.12) 


just as in momentum space (Vladimirov, Kazakov & Tarasov (1979)). 
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(a) (b) (c) 
Fig. 11.2.4. Two-loop self-energy graph and counterterms. 


Our final graph, Fig. 11.2.4, is a little more difficult. Its unrenormalized 
contribution to the propagator is 


M, =5(— igu’) [atratesetx, — y)Sp(y — z)°S_(z — X2) 


= p latyates My z;{x,})S,(y — Z}. (11.2.13) 


The difficulty is that in momentum space there is not only the quadratic 
overall divergence, but also three logarithmic subdivergences. However 
(11.2.13) appears to have only an overall divergence, when y goes to z; this 
involves all three propagators. Furthermore, the counterterm, calculated in 
momentum space, for a subdivergence from two of the lines gives a graph of 
the form of Fig. 11.2.1(a). There the third propagator is at y = z, whereas the 
propagators in (11.2.13) are also used at y # z. 
The correct way to handle this case is to write 


Sr = Sry + Sp + S (11.2.14) 


rem? 


where S+, is the same leading term as before, and Sp, is the second term in 


K 


Fsing . 


27 T@2-2 4-4 
Se (x)= -ea amn -| (11.2.15) 


The remainder S,em(y — z) is finite as d > 4 and/or y > z. We now substitute 
(11.2.14) for each factor of S,(y — z) in (11.2.13) and obtain 
M, = 7) | d'yd’zf (y, z)Sp,(y — z)° + 3 | (y, 2) SF Spe 


+ 3 fre% 2S. Siem} + finite. (11.2.16) 


The term with all S,,’s has the quadratic divergence, while the terms with 
two Sp; s are logarithmically divergent. The other terms are finite. The term 
with Sž; S,em has a divergence coming from the two S,,’s, and its factor 3 
reflects the fact that there are three divergent subgraphs. The graph with the 
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counterterm for the subdivergences is Fig. 11.2.4(b): 
4-d 


4b — 8x4 d) d*yf Oy, y; {X;})S,(0) 


er, d*yf Oy, y; {x;})[S,2(0) T Sem(9) J. (11.2.17) 


Here we used the value we calculated at (11.2.7) for the counterterm vertex. 
The S,,,,(0) term here cancels the corresponding divergence in the third 
term in (11.2.16), while the Sp, term combines with the second term in 
(11.2.16) to produce a logarithmic divergence from 


39° | dyf Cy, »i | dz Spi (y — 2)? Saly — zyme * 
z=y 


ue 
-wai 5500} (11.2.18) 


Let us return to (11.2.16). The divergence in the first term is sensitive to 
f™ and its second derivative at y =z: 


g? fef d’ySpi ly — PLS O, z) + (y — f ay”), 


+ Hy — z}"(y — 2)"(0°F  /dy*dy")|, -2] 
= finite + [2ig?x4/?/T'(d/2)] x 


T/2- 1) y aro 
déz yt} (4) 
«| |, (antes | F2 -ayol 


= finite — ( 2) | df yd’zf (y, z) 6(y — z)/(8 — 2d). (11.2.19) 


Note that the term with f(z, z) gives a pole at d = 2 but not at d = 4. 


11.3 Flat-space renormalization 


In the last section we computed counterterms for some low-order graphs. 
Our method was not a good method for computing the finite parts. But it 
made very explicit the fact that the counterterms depend only on the short- 
distance structure of the free propagator. In particular it made it obvious 
that the counterterms are independent of temperature. We must now spell 
out how to generalize the results to an arbitrary graph in an arbitrary 
theory. 

We define the renormalization of a graph G by the same structure that we 
had in momentum space (i.e., by the recursion method or by the forest 
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formula). A counterterm is needed for the overall divergence of every 1PI 
subgraph. The overall degree of divergence of a 1PI subgraph is obtained by 
first counting powers of position as all of its vertices approach each other, 
and by then taking the negative of the result (since it is an x > 0 divergence). 
The integrations over relative positions of the vertices are included in the 
power-counting. The result coincides with the usual momentum-space 
definition for a 1 PI subgraph. The value of Gis obtained by integrating over 
positions of its interaction vertices, and the divergence and subdivergences 
come from regions where some of these positions approach each other. To 
each region corresponds a subgraph consisting of the vertices that go to the 
same point, together with all the lines joining them. 

It might appear that these graphs should be in one-to-one cor- 
respondence with the 1 PI subgraphs that in momentum space are divergent 
when all their internal loop momenta are large. However this is not so, for 
the following reasons: 


(1) There are graphs overall divergent in coordinate space that are not 1PI. 

(2) There are divergent subgraphs in momentum space that are not 
obtained directly in coordinate space; these are when some but not all 
lines connecting vertices of a 1PI graph are in the subgraph. 


An example of the first case is Fig. 11.3.1, where there is a logarithmically 
divergent graph consisting of the vertices at w, y, z, and of the lines joining 
them. In momentum space the counterterms for the self-energy subgraph 
cancel all the divergences. But in coordinate space Sp(w — y) is singular, so 
we have to justify the momentum-space result. Other graphs are divergent 
in momentum space but do not occur directly as divergences in coordinate 
space. A typical example is given by the one-loop subgraphs of Fig. 11.2.4. 
The divergence comes from the region y >z, and it appears to involve all 
three propagators, never just two propagators. As we saw in Section 11.2, 
the trick to handling this problem is to make a decomposition of the 
propagator, as in (11.2.14). 


Fig. 11.3.1. The w—y—z subgraph is one-particle-reducible and apparently has an 
overall divergence in coordinate space. 
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First, however, let us consider the problem of divergent 1PR graphs like 
the (w, y,z) subgraph of Fig. 11.3.1. After subtraction of the self-energy 
counterterms, Fig. 11.3.1 has the form 


[twatyatar St —y)x 


x [S.(y — z} + Ad(y — z) + BO ô(y — z2)]Sp(z — x3) (11.3.1) 


Here f(w) is a function non-singular at w = y. The factor in square brackets 
is the self-energy plus counterterms. At d=4, we have S,(w-— y)~ 1/ 
(w—y)* and S,(y —z)? ~1/(y —z)®, so it might appear that there is a 
logarithmic divergence when w and y both approach z. A counterterm for 
this divergence would be a four-point vertex and therefore allowed. 
However, the momentum-space result gives no such counterterm, so it is 
important to derive the same result directly in coordinate space. 
We first integrate over w, and see that 


| dase 


is finite and non-singular as a function of y. Then we can perform the y- 
integral. Although S,(y — z) is singular at y =z, it is integrated with a 
function with no singularity there, so the counterterms are sufficient to 
cancel the divergence. Then, finally, we integrate over z. The crucial point is 
that we find an order of integration with the following property: — each 
integral kills the singularity on precisely one propagator without introduc- 
ing new singularities. If we replaced Fig. 11.3.1 by Fig. 11.2.3, say, we would 
have a not-quite similar integral: 


[atwatyat Sp(w — y)Sp(w — z)[S,(y — z} — counterterm ]. 


The integral over w, for example, involves two propagators S,(w — y), 
S;(w — z). The result becomes singular at y =z. 

So far we have chosen to define renormalization by the forest formula or 
by the recursive method. We have seen that, after subtraction, only 1PI 
graphs have divergences. Some subgraphs that need counterterms in 
momentum space (like Fig. 11.2.4) do not appear explicitly as divergences 
in coordinate space, since we defined the subgraph corresponding to a 
coordinate-space divergence to include all the lines connecting its vertices. 
Even so, we saw in Section 11.2 that we have counterterms for such 
subgraphs — as in Fig. 11.2.4(b). To show how these counterterms arise in 
general and to show that the counterterms are independent of temperature, 
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we decompose Sp as 
Sr = Sri +e + SEN + Se (11.3.2) 


Here Sp, to Spy are the first N terms in the singular part S,.;,, of Sp. We 
choose N so that 2N is larger than the degree of divergence of any subgraph 
of the graph G that is being renormalized. Thus when S, is replaced by S 
any (sub)graph y containing it becomes overall convergent. 

When we substitute (11.3.2) for every propagator in a graph, we obtain a 
sum of graphs, in each of which every propagator is replaced by one of 
Srio--+> Sen, OF Srem. We renormalize each of these new graphs separately. A 
subgraph needs acounterterm if its overall degree of divergence is a positive 
integer or zero, and we make counterterms for all 1PI subgraphs that are 
divergent. 

The rules for computing the degree of divergence are essentially obvious. 
Any propagator with S, replaced by Sp, contributes as in the original 
graph, and a replacement by Spy reduces the degree of divergence by 
2N — 2. The only subtlety is that replacement of S,, by S,..,.(x — Y) is 
considered as contributing — 2N to the degree of divergence, even though it 
does not behave as (x — y)*~4*?% when x — y0: Its analytic part behaves 
as (x — y)°. However, the analytic part never actually contributes to any 
overall divergence, and the singular part of S,,,, does go like (x — y)?~7*?% 
relative to Spi- 

A typical case is the third term on the right of (11.2.16). Before subtraction 
of the subdivergence, this term is proportional to 


rem? 


rem 


fy Hy, Z)S eml Y — Z)Spy(y — 2)?d*yd4z 


Now Spi (y — 2)? ~ (y — z)*” 24 as y > z, while S,,. > SFana (0) # 0. So there is 
a logarithmic divergence at y = z. But after subtraction, we have: 


. d—4 


[row 2)Srem(¥ — al Sel ta = reer yaa z) Jaya 


When integrated with an analytic function like f(y, z)S,,,,(y — 2), the 
factor in square brackets gives a finite result. So any divergence comes from 
the singularity in S,,,,. But the singularity is of order (y—z)°~% so no 
divergence occurs. 

We now generalize our treatment of Fig. 11.2.4. We examine structures of 
the form of Fig. 11.3.2(a). The subgraph A is 1 PI and overall divergent. The 
lines |, ,...,1,join vertices of A. A graph that consists of A and some or all of 
the /,’s is 1 PI and is overall divergent. In coordinate space, these divergences 
come from the region where all vertices of A approach the same point; it 
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Fig. 11.3.2. General structure where subgraphs are divergent in momentum space, 
but appear to have no corresponding divergence in coordinate space. 


appears that the lines /,,...,/, all participate in every one of the divergences. 

In order to preserve the correct counterterm structure we must include 
counterterms for graphs of form A in our definition of renormalization. 
This ensures that counterterms are given by terms in the action. For 
simplicity consider the case in @* theory of one line l with two external lines 
Pi» P2. Schematically we have 


[twatatyatss( — w)A(w, x, y, Z) f(y, Z). (11.3.3) 


Here S,(x — w) is the propagator of line l, while f(y, z) is analytic at y =z 
and represents the rest of the graph. A(w, x, y, z) is the value of graph A. We 
subtract off all subdivergences of the graph Av {1}. Among these is the 
overall counterterm for A, which is of the form 


C,(w, x, y,234, g, p) = C(d,g, u) P(w — 2)5(x — z)6(y — z), (11.3.4) 


since A gives a logarithmic divergence. The result of replacing A by its 
counterterm is then 


C(d,g, u) | d4zf (z, z)S,-(0). (11.3.5) 


This counterterm is necessarily generated by the Feynman rules, since when 
A occurs in a four-point Green’s function without additional loops it needs 
an overall counterterm. We wish to show how this counterterm is needed to 
cancel certain parts of the divergences of (11.3.3) as yz. 

Let us now substitute each propagator by (11.3.2) with N=2. The 
propagator for line Imay be replaced by Sp,» S,., or S em: The terms with Sp; 
and Sp, both contribute to a divergence at y = z, whether in the original 
graph (11.3.3) or in (11.3.5). The divergence from (11.3.5) with Sp = Sp; or 
Sr2 is local and temperature-independent, so that it can be cancelled by 
overall wave-function and mass counterterms. The terms with S em also give 
divergences, but cancel in the sum of (11.3.3) and (11.3.5), just as in the 
simplest case of Fig. 11.2.4. 
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The argument presented above is considerably more cumbersome than 
needed for the particular case considered. However it was presented so as to 
emphasize its form as a special case of a general argument. This case 
happens to be the only one present in Green’s functions. However in the 
vacuum bubbles (used to compute ground-state energies) and in the 
presence of composite operators the degree of divergence can be higher, 
with the consequence of needing a more general proof. 

In any event, the moral is that if we choose counterterms to cancel 
divergent subgraphs including those of form A in Fig. 11.3.2, then the 
counterterms need only depend on the S,,, S;,.... In particular, the value 
of Srana Is irrelevant. It is only Spana that knows the boundary conditions, so 
only it knows the temperature. Note that the Sp;s are monomials in mass. 
Hence our counterterms are polynomials in mass parameters, as we saw by 
a totally different method in momentum space. 


Fana 


11.4 External fields 


Consider as an example QED in the external electromagnetic field 
generated by a classical source J”. We have (in covariant gauge) 
P= —4Fi,+ Wligt+ eA— M)y -— J'A, 
—4(Z3 — 1) Fi, + (Z2 — IY lið + eA) — py (Z,Mo — M) 
— (1/2€)d-A?. 
(11.4.1) 

To separate the classical and quantum parts of the electromagnetic field, we 
let J, be a c-number potential that satisfies the classical Maxwell equations 
with source J”. Then we replace A, in (11.4.1) by 4, + £ „ with the result 


L = — 4 F7, —(1/26)d-A? + Wid + e4- M)y + epAy 
—4(Z; — 1)Fiy + (Z3 — IY (i+ eA) — (Z Mo — M)ýy 
thot J" +A(Z, 1), J" + (Z; — AJ” 
+ (Z, — 1)West + total derivative. (11.4.2) 


We assume the gauge condition ô: = 0. The total electromagnetic field is 
A, + £, with the classical c-number part satisfying 


0:% =0 
D4 = J". (11.4.3) 
If the field £, is large then we are not allowed to expand in powers of 


Á, Indeed, as Schwinger (1951) pointed out, the electron propagator 
S(x, y;ex) in the external field is not necessarily analytic at æ = 0. 
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Moreover, he showed that it is the non-analytic part that is relevant for pair 
production in an electric field. We will therefore do perturbation theory in e 
without assuming that e.?, is small. The lowest-order propagator (in 
powers of e) therefore satisfies the equation: 


(iy"0/Ox" + ey", — M)S,(x, y;e) = id O(x — y)1. (11.4.4) 


Since Sp is no longer simple in momentum space, it is not clear that the 
renormalizations are the same as with .o/, = 0. Let us work in coordinate 
space. We construct a power series in x — y to solve (11.4.4). The solution is 
a series singular at x = y, plus a series analytic there. We only need explicitly 
the first few singular terms, and to prove renormalizability we decompose 
Sp as in (11.3.2). It is important not to use an infinite series for Sp at that 
stage, because the series will not converge for general values of x, y and £. 
The singular series is obtained by treating both e and M in (11.4.4) as 
perturbations and expanding in powers. This is similar to the way in which 
the singular part of the scalar propagator is an expansion in powers of m°. 
We obtain the leading power of x — y by solving 


iy"0/Ox"S,., =id(x — y)1. (11.4.5) 


Fach non-leading term S,., is obtained in terms of earlier terms by solving at 
x # y the equation 


n-1 


iy"0/0x"S,, =(M — ex y") $ Sr; (11.4.6) 
j=l 


To obtain Sp,„ uniquely, we require it to be a singular power of x — y times a 
function of y. The operation iy“d/dx" makes Sp, more singular while 
multiplication by M or by ey"#, leaves the degree of singularity 
unchanged. 

After this work we see that the renormalizations are correctly given by 
treating as a perturbation the term J*A, in the Lagrangian (11.4.1) before 
the shift of the field to A , + £, Since A, is the renormalized field, this term 
cannot affect the divergences — it simply tells us to integrate J“(x) separately 
with each of one or more external photon fields of an ordinary Green’s 
function. 

We see that, after the shift to A, + £, the divergences are correctly 
treated by expanding in powers of . and then taking the first few terms. 
The non-analyticity is entirely confined to the remainder term in S+; this 
contributes to important physics, but not to the divergences. 
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Renormalization of gauge theories 


It is important to show that renormalization of a gauge theory can be 
accomplished without violating its gauge invariance. Gauge invariance is 
physically important; among other things it is used (via the Ward 
identities) to show that the unphysical states decouple (t Hooft (1971a)). 

In Chapter 9 we considered the case that the basic Lagrangian of a theory 
is invariant under a global symmetry, as opposed to a gauge symmetry, 
such as we will be investigating in this chapter. We showed that the 
counterterm Lagrangian is also invariant under the symmetry. Suppose 
now that the basic Lagrangian is invariant under a gauge symmetry. One 
might suppose that the counterterms are also invariant under the 
symmetry, just as for a global symmetry. This is not true, however, since the 
introduction of gauge fixing (as explained in Sections 2.12 and 2.13) destroys 
manifest gauge invariance of the Lagrangian. One might instead point out 
that the theory with gauge fixing is BRS invariant and deduce that the 
counterterms are BRS invariant. This deduction is false. To see this, we 
recall that an ordinary internal symmetry relates Green’s functions with 
certain external fields to other Green’s functions differing only by change of 
symmetry labels. However, BRS symmetry relates a field to a composite 
field (2.13.1). This wrecks the proof of BRS invariance of counterterms 
except in an abelian theory, where the Faddeev—Popov ghost is a free field. 

Before treating the non-abelian case, let us examine an abelian theory, 
QED. The Lagrangian is 


L = —4F i, + Wid + er A- My 
— 4(Z3 — 1)F iy + (Z2 — IlG + erd — M)yp 
— Z,W(M, — M)w — (1/2€)d- A?. (12.0.1) 
Here eg and M are the renormalized coupling and mass, while 4,, y and ọ 


are the renormalized fields. We have chosen to include only counterterms 
invariant under the gauge transformation 


y > eaey, 
Y > e`isop, (12.0.2) 
A,7A,+ 0,0%. 
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A special case of our later results will show that coefficients of the possible 
gauge non-invariant counterterms actually vanish. (We will write eg = 
u? ~’? e if dimensional regularization is used, with e being the dimension- 
less renormalized charge.) In terms of unrenormalized quantities A? = 
Z? A Wo = Z}? W, eo = Z3 ep, and éo = EZ, we have 


f=- 4(0,A oy ai 3 Aou) + Wolid + eoåo — Moo — (1/2€,)0- AG. 
(12.0.3) 


The counterterm for the photon—electron vertex is, from (12.0.1), 
(Z, —lepW Ay, (12.0.4) 


which has a coefficient proportional to the wave-function renormalization. 
It is easy to verify that to one-loop order this result is correct. Our later 
results prove it to all orders. 

If the non-abelian theory of (2.11.7) and (2.11.8) were renormalized by 
gauge-invariant counterterms, ther the same relation of the Y Aw counter- 
term to the quark field strength renormalization Z, would hold. It is easily 
verified by explicit calculation that this is false. As we pointed out above, the 
reason that the counterterms need not be BRS invariant is that the BRS 
invariance is not a symmetry that relates elementary fields to elementary 
fields. Even so, the counterterms are such that the Lagrangian is BRS 
invariant after renormalization, but under a renormalized BRS symmetry, 
as we will show in this chapter. 

One result will be that whereas the gauge transformation of a fermion 
field in QED is given by dw = iegwy, with eg being the renormalized charge, 
the BRS transformation of a fermion field in QCD is ôy =iggc*t*X pod, 
where X is a divergent renormalization factor. The composite operator ôy 
is finite. 

There are a number of strategies for proving renormalizability. Before 
explaining them, let us remark that the aim is to show that a finite theory 
exists which has gauge-invariance properties. The gauge invariance is 
exhibited by Ward identities. It is possible to choose counterterms in such a 
way that gauge invariance does not hold. For example, we could add a term 
(AA D: to the QED Lagrangian. Since the coefficient of this term is 
dimensionless, we obtain a finite theory by adding appropriate extra infinite 
counterterms. But this theory is not gauge invariant. This implies, for 
example, that negative metric states do not decouple from physical 
processes, and the theory is unphysical. The proofs state that it is possible to 
choose counterterms so that gauge invariance holds. 

Several approaches can be distinguished : 
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1. Invariant regulator. Use an ultra-violet regulator that does not break 
gauge symmetry, for example dimensional regularization. Then Ward 
identities are true when the Lagrangian is given by (2.11.7), (2.12.5), and 
(2.12.9). Allow all parameters and fields to get renormalized. The theorems 
to be proved are that this is sufficient to make the theory finite when the 
regulator is removed. This is the traditional approach. The advantages 
center around the manifest preservation of gauge invariance. The disadvan- 
tages are that chiral symmetries cannot be regulated gauge invariantly; this 
symptomizes the fact that not all chiral symmetries can be preserved after 
quantization — see Chapter 13. 


2. Gauge invariant regulator + MS. Let us again use dimensional reg- 
ularization (or another gauge-invariant cut-off). But now let us choose to 
renormalize each separate graph by the forest formula or by the recursive 
method (as given in Chapter 5). We do not explicitly constrain the 
counterterms to satisfy gauge invariance; so in general we have violated 
gauge invariance. But if we use minimal subtraction then the counterterms 
are gauge invariant. The reason is simple: since we will prove that we can 
renormalize the theory gauge invariantly, the lowest-order counterterm 
that is not gauge invariant must be finite, after summing over all graphs of 
this order. But in minimal subtraction the only finite counterterm is zero. 
This method is of great use when renormalizing the complicated non-local 
operators that appear in generalized operator product expansions (Collins 
& Soper (1981)). We can renormalize the graphs without explicitly 
investigating the Ward identities. The disadvantage is that the method is 
closely tied to a specific renormalization prescription. 


3. Non-invariant regulator plus non-invariant counterterms. One can use 
any regulator and adjust overall counterterms, if possible, to satisfy all the 
Ward identities (Piguet & Rouet (1981), Symanzik (1970a), °t Hooft (1971a), 
and Piguet & Sibold (1982a, b, c)). 


There are two different forms of the Ward identities, either of which may 
be used. There are the Ward identities derived in Section 2.13 for Green’s 
functions, and there are the ones for the 1PI graphs, as derived by Lee 
(1976). Our approach will use the BRS identities for Green’s functions 
together with a combination of approaches 1 and 2. It is based on the 
treatment of Brandt (1976). Most other treatments have used the identities 
for the 1PI graphs. 

We will restrict our attention to the simplest theories, like QCD. More 
general cases — with chiral or supersymmetries — are not treated here. See 
Chapter 13 for references. 
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12.1 Statement of results 


For simplicity we will mainly treat one case: a theory of a gauge field 
coupled to a Dirac field, with the gauge-fixing term being the usual 


one, — 5 (3: A? (28). We assume that the gauge group is simple (in the 


mathematical sense); physically, this implies that there is only one 
independent gauge coupling. With a U(1) gauge group and one or more 
Dirac fields, this theory is quantum electrodynamics. If the gauge group is 
SU(3) and the matter fields are in the triplet representation, then we have 
quantum chromodynamics. 

The result to be proved is that the Green’s functions are made finite by 
renormalizing the values of all the parameters in the basic Lagrangian 
(2.11.7). These parameters are the gauge coupling, the fermion masses M, 
the field strength renormalizations, and the gauge-fixing parameter. 

The resulting Lagrangian expressed in terms of renormalized fields is 


L= —4Z; Giv + » ZU GD — MDW; 
— (1/2£)d- A”? + Z0,6°D"C,. (12.1.1) 


We have allowed the fermion fields to be in several irreducible repre- 
sentations of the gauge group labelled by i. There are separate field-strength 
renormalizations Z® and bare masses M® for each representation. The 
covariant derivative is 


D,W = (8, + igo 48,09 
= (ô, + iggX ŽAS), (12.1.2) 
where g, and gp are the bare and renormalized couplings. (With dimen- 


sional regularization we write g,=p?~“/*g, with g dimensionless.) 
Following Lee (1976) we write the bare coupling as 


X 


Jo SZZ (12.1.3) 


so that the coupling of the gauge field to the ghost is Xg,: 
Za, OD*c, = Ž0 0c, + Op X Canl NAE. (12.1.4) 
The field strength tensor is 
Civ oe a Gouv 
= Z3 ° (0 Ao — Adu — GoCabeA ou40») 
= 0,A% — 0,A% — JRX ŽT Caw ALAS. (12.1.5) 


Observe that in accordance with theresults to be proved, the coefficient of 
the gauge-fixing term (ð: A}? is finite, when the renormalized field is used. 
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This gives a gauge-fixing term — (3: A,)*/(2&,.), when expressed in terms of 
the bare field, with 
Čo = 236. (12.1.6) 


The main theorem to be proved is: 


Theorem 1. The renormalizations 
M®, X, Z, Z®, and Z, (12.1.7) 


can be chosen so that Green’s functions of A, Y, W, c, and € are finite. 

To prove this result we will use the Ward identities for BRS invariance. 
These involve a number of composite fields, which we also need to prove 
finite. The counterterms for these composite operators are related to the 
basic counterterms (12.1.7). We will prove: 


Theorem 2. 
Opps(renormalized field)/dA, 


is finite. That is, its Green’s functions with any number of renormalized 
fields are finite. We define the renormalized BRS transformation dp2,/6/, as 
follows: 


(1) Let the BRS transformation ôggs be defined by (2.13.1), (2.13.2) with g 
and č replaced by g, and &, and with the fields replaced by 
unrenormalized fields (i.e., A > Ap, etc). Then the Lagrangian (12.1.1) is 

BRS invariant. 
(2) Define 54, = ĝAZ3 t2 Ž- "2. Then 


OpW = OppsW//OAp = — IJR Xt YC, 
bp Aa = Ôgrs4 t/g = 66,2 + JrCaeC” ASX, 
Ope” = OppsC*/OAg = — 59RX Cave’ = — tgk Xc ac,? (12.1.8) 


1 
One = Oprsc’/OAR m A 


Renormalized Ward identities follow from the unrenormalized ones by 
multiplication by 6A/dA,. The operators appearing in them are finite, 
because of Theorem 2. 

Certain auxiliary operators are useful for reasons which only become 
apparent in proving Theorems 1 and 2. 


O, = ŽO" + PAIX Cares (12.1.9) 
Ry, =(Ž/X —1)0,27/9p + CarchpAs, (12.1.10) 
Č A Č = C plp. (12.1.11) 


The operator 0, is zero by the ghost equations of motion. We will prove: 
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Theorem 3. Green’s functions of 0,, Z,,,and ¢ ^ č with any number of basic 
fields are finite. 


Theorem 4. Green’s functions of #,, with dg, and any number (greater 
than zero) of basic fields are finite, where @ is any basic field. 


Theorem 5. Green’s functions of č ^ č with one or two 6p,q@’s and any 
number (bigger than zero) of basic fields are finite. 


The last few results have no intuitive appeal. They will be needed as part 
of an inductive proof of the important Theorem 1. We will also find it 
convenient to use CPT invariance of the theory (after dimensional 
regularization). Now reversal of one time and one space component is 
equivalent to reversal of components 0, 1, 2, and 3 witha spatial rotation. So 
to obtain the TP part of CPT, we need only consider reversal of the 0 and 1 
components only. Therefore, we define 


—1, ifu=v=O0orl, 
Oe =< +1, ifu=v>z2, 
0, otherwise. 
Let the fields transform under CPT as 
W(x) > yW"(Ox), 
A%(x)— A*(Ox), 
c*(x) > c*(Ox), 
C(x) > (Ox). 


(12.1.12) 


(12.1.13) 


Then the theory is CPT invariant. (We use y-matrices in which y° = 
°T = y°*, and y“t = y,.) Notice that the ghost field, c,, transforms to itself, 
rather than to the antighost field, c*, even though these fields might be 
regarded as complex conjugate fields. 


12.2 Proof of renormalizability 


12.2.1 Preliminaries 


The Ward identities of a gauge theory provide relations between different 
Green’s functions. However, the identities mostly relate Green’s functions 
of elementary fields to Green’s functions containing the composite fields 
listed in (12.1.8)—(12.1.11). However, to prove renormalizability, we actually 
need relations between Green’s functions of elementary fields only. 
Consequently proofs tend to be rather indirect and long. 

The following references contain a representative selection of the proofs 


12.2 Proof of renormalizability 299 


in the literature: °t Hooft (1971a, b), Taylor (1971), Slavnov (1972), °t Hooft 
& Veltman (1972b), Lee & Zinn-Justin (1972), Becchi, Rouet & Stora (1975), 
Lee (1976), Itzykson & Zuber (1980), and Piguet & Rouet (1981). A great 
simplification was introduced by the discovery by Becchi et al. of their 
symmetry. However, the proofs still are mostly rather inexplicit. The proof 
to be given in this section gives all the steps needed to go from the basic 
Ward identities to the relations between the counterterms. The method 
follows that given by Brandt (1976) and Cvitanovic (1977). A point at which 
many proofs became rather inexplicit turns out in this method to be the 
point at which the operator &,, (defined in (12.1.10)) makes its appearance. 
It is an unobvious operator to use, but its use is essential to completing the 
proof that the gauge coupling to matter fields is the same as the self- 
coupling of the gauge field. 

The proof is by induction on the number, N, of loops. We assume that all 
graphs with less than N loops have been successfully renormalized by 
counterterms of the form implied by the Lagrangian (12.1.1). We also 
require that Green’s functions of the composite operators considered in 
Theorems 2 to 5 are also finite up to N—1 loops if the indicated 
counterterms are used. The induction starts with tree graphs, which need no 
counterterms. 

Our strategy is as follows: 


(1) At each order below N loops we have values of the five independent 
renormalizations Z, Z,, Z}, Mo, and X. For each 1PI Green’s function 
with an overall divergence a value of the overall counterterm is hence 
computed at each order less than N. Partition this counterterm into a 
set of counterterms to cancel the overall divergences of the individual 
graphs for the Green’s function. 

(2) Compute N-loop contributions to Z, Z,, Z} My, and X by imposing 
renormalization conditions on certain Green’s functions. The Ward 
identities will be true, but it is not immediate that the many other 1PI 
graphs are finite. This is done at step (3). 

(3) Using these Ward identities show that Theorems 1 to 5 hold for N-loop 
graphs. 


Step (1) is technical but important. Its use is that, in order to say that the 
only divergence of an N-loop graph is the overall divergence, we must have 
subtracted off its subdivergences. However, for individual graphs the 
constraints imposed by gauge invariance do not hold. Consider, for 
example, the two-loop graph, Fig. 12.2.1. Its self-energy subgraph needs a 
counterterm of the form 4,g,,k* — B,k,k,. 
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,r~ 
N 
Fig. 12.2.1. Fig. 12.2.2. 


Similarly the subgraph of Fig. 12.2.2 has a counterterm 4,g„k° — 
B,k,k,. As we will see, the Ward identities imply that the total self- 
energy counterterm is obtained from the term — 4(Z; — 1)(0,A,—0,A,) 
in X. It follows that A, + A, = B, + B,. However, the relation is false for 
the separate graphs, i.e., we have A, # B,, A, £ B3. 

At steps (2) and (3), where we discuss the N-loop counterterms, we then 
know that the only divergences are overall ones. Moreover, we know this 
without having to check on intricate series of cancellations between 
different graphs. But for the purposes of finding the constraints imposed by 
the Ward identities on the N-loop counterterms, it is convenient to consider 
a single overall counterterm for the sum of all N-loop graphs for a given 1PI 
Green’s function. Having obtained these constraints, we decompose the N- 
loop counterterms into individual portions for each graph; this enables us 
to continue the induction at the next order. 

If we use minimal subtraction, the counterterms can be obtained graph- 
by-graph without worrying about the constraints imposed by Ward 
identities. These constraints will be satisfied automatically. For example, in 
a general renormalization prescription the counterterms to the subgraphs 
in Figs. 12.2.1 and 12.2.2 have the form 


A, =g97[a,/(d — 4) +a; ], 
B, = g*[b,/(d — 4) + bi], 
where a; and b; are finite and depend on the renormalization prescription. 


The Ward identities tell us that we can renormalize the divergences by a 
transverse counterterm 


(a, +a,)/(d—4) +a, +a, =(b, + b,)/(d —4) + bi + by. 


Evidently a, +a, =b, +b,, and a, +a, =b; +b,. The first equation 
must always be satisfied, the second must be imposed by choice of re- 
normalization prescription. Minimal subtraction with a] =a, = 
b; =b} =0 always satisfies these equations. 


12.2.2 Choice of counterterms 


We now assume that step (1) has been carried out. The next step is to pick a 
set of 1 PI Green’s functions to fix Z, Z,, Z}, Mo, and X. This is somewhat 
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arbitrary, but our choice will determine the form of the rest of the proof. We 
choose the following set: 


(1) The fermion self-energy has divergences proportional to p and to 1. 
These are cancelled by counterterms in Z, and M,. 

(2) Z is chosen to cancel the p? divergence in the ghost self-energy. 

(3) X is chosen to make the ghost—gluon coupling finite as far as the C pe 
part is concerned. 

(4) Z, is chosen to cancel the part of the divergence of the gluon’s self- 
energy that is proportional to — g,,k* +k,k,. 


Next we will examine the Green’s functions used in Theorems 1 to 5 to 
check for possible divergences at N-loop order. Since all divergences at 
lower order are cancelled, the possible remaining divergences are overall 
divergences of N-loop 1PI Green’s functions. These are Green’s functions 
with either elementary external lines or with insertions of the various 
composite operators we use. The dimension of a 1 PI Green’s function must 
be zero or greater in order that it have a non-negative degree of divergence 
and thus be potentially divergent. The contributions to such a Green’s 
function are (a) N-loop basic graphs, (b) graphs with counterterms to cancel 
subdivergences, (c) an overall N-loop counterterm derived from our 
knowledge of ZZ 2 Z3, Mo, and X. We must prove that the sum of these 
contributions is finite. 


12.2.3 Graphs with external derivatives 


There is a derivative on a ghost line where it exits from an interaction. Thus 
the 1PI graphs of Fig. 12.2.3 have negative degree of divergence even 


->- =J 
Y contributing to ¢0|Tcccc|0> 
-J ->— 


contributing to ¢<0|T cc A A|0> 


CZ... 

4 : : 2 

« contributing to ¢<0|Té Aécc|0> 
©- ->- 


Z ao 
contributing to <0|T2,,4c|0> 


Fig. 12.2.3. Graphs with negative degree of divergence and non-negative dimension. 
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though their dimension is zero. Subdivergences are cancelled and no overall 
counterterms are present, so the corresponding 1PI Green’s functions are 
finite. 
The same argument shows that the ghost self-energy needs no mass 
counterterm, but only a field-strength renormalization Z. 


12.2.4 Graphs finite by equations of motion 


Consider Green’s functions of ©, with basic fields. These are finite. For 
example, 


{0|TO,(xJe(y)|0> = Z0,<0|Te A 


+ gpXc ê co] Tc(x)A™(x)é?(y)|0> 


adc Ayu u 

= — i165 (x — y). (12.2.1) 
The only graphs for the left-hand side with an N-loop 1PI subgraph are of 
the form Fig. 12.2.4(a) and (b). The graph (a) has a ghost self-energy made 
finite by its wave-function counterterm. Graph (b) needs a counterterm in 
©, proportional to ()c,. Such a counterterm is graph (c), which has the N- 
loop contribution to the Ž 0] c, term in (,. Finiteness of (12.2.1) shows that 
this is the correct counterterm. Similarly, the other potentially divergent 
Green’s function of (,, viz., 

<0|TO,c’A°|0> 
is finite. 
-L <- Loe E 


(a) (c) 
Fig. 12.2.4. Overall-divergent O for Green’s functions of ©, 


12.2.5 Gluon self-energy 


We have the Ward identity 
0 = 5<0| Td-A(x)é(y)|0>/dAg 
= (1/€)<0| Td- A%(x)d- A(y)|0> — <0| TOX) y) > 
_loé ð 


<0 T A*(x)A™(y)|0> + 15 (x — y). (12.2.2) 


The second term involves (, because of our definition of ô&4,. In the last 
line we used (12.2.1) on ©,, and remembered that Green’s functions of ô: A 
have the derivatives outside the time-ordering, by definition. 
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mnh ya 


Fig. 12.2.5. Gluon self-energy. 


The only possible divergences in <0| T44(x)A$(y)|0> are in the N-loop 
self-energy graphs (Fig. 12.2.5), and they consist of terms proportional to 
9,yM? or to k,k,. Remember that we used Z, to cancel any divergence 
proportional to g uk? — k,k,. Both the remaining divergences are absent, by 
(12.2.2). Thus no renormalization of the gauge parameter is needed, and no 
gluon mass is needed. The form of (12.2.2) is just as in QED. It implies that 
the gluon self-energy is purely transverse, so that the longitudinal part of 
the propagator is unchanged by higher-order correction. Hence, we can 
write: 


; i ma + 2 
[atxer=olraginayooy = ia eal _ State) 


q? + ie 1 + M(q’) q? 
(12.2.3) 


12.2.6 BRS transformation of Aj. 


Consider the Ward identity 
0 = < 0| TA4(x)e*(y)|0>/d2Q 
= (1/€) ¢0| TAa(x)d-A(y)|0> — (0| TRA x). (12.2.4) 

The first term we have just proved to be finite. The only potentially 
divergent graphs for the second term are shown in Fig. 12.2.6. Equation 
(12.2.4) shows their sum to be finite. 

The c(y)A(z) Green’s function (Fig. 12.2.7) of 6,A is also possibly 
divergent (logarithmically). But we have: 


0 
5 < O| T ôr 4“(x)Z(y)AS(2)|0> 
X 


= (0| TO%x)é(y)As(z)|0 
= ~ 15% (x — y)5 (0| 45(2)|0) 
=0, (12.2.5) 


| x 
| | í -Q -- + r--4-- 
20,C4| -loo DA N a 
N-loop a P 


Fig. 12.2.6. Divergent graphs for right- Fig. 12.2.7. Graphs for 
hand side of (12.2.4). (0|T6,A™(x)e(y)AS(z)} OD. 
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Applying 0/0x" is equivalent to multiplying by k, in momentum space. 
Since the divergence is at most logarithmic, (12.2.5) shows that there is no 
divergence at all. 


12.2.7 Gluon self-interaction 


Now 
0=45<0| T A*(x)A*(y)e(z)|0> /OdAp 
= (0| Td, Ai(x)Ai(y)e(z)|0> 
— (0| TAL (x)bg A (y)e(z)|0> 
— €0| TAL(X)AW(y)dpe(z)|0> 


Yy 


aa a aA aoe 
= finite tgga 0l T44 A 2loy, (12.2.6) 


where we used the previous result. The three-gluon vertex is linearly 
divergent, and we have a counterterm equal to (Z,XŽ-! — 1) times the 
lowest-order vertex. There is no possible left-over divergence that satisfies 
(12.2.6). 
Similarly 
ô 
oz” 
Here the only potentially divergent N-loop 1PI subgraphs are as in 
Fig. 12.2.8. We have just seen that the divergences in Fig. 12.2.8(b) are 
cancelled by the counterterm for the triple gluon coupling. Since the 
divergence in graph (a) is logarithmic, (12.2.7) proves that it is exactly 
cancelled by the counterterm in the four-gluon interaction. 


A + crossings 


(a) (b) 
Fig. 12.2.8. Potentially divergent graphs for four-point Green’s function of gluon. 


<0|TA,(w)A (x)A,(y)A%(z)|0 = finite. (12.2.7) 


12.2.8 dpc 


The only Green’s function of 6,c that could be divergent is 
<0| Tdgc El. (12.2.8) 
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But 
0 = 6¢0| Tc%(x)e*(y)e(z)|0 
= (0| Tdgc(x)e*(y)e(z)|0> 
— (1/€)<0| Tc%(x)d- A*(y)e*(z)|0> 
+ (1/€) < O| Tc(x)é*(y)d- A(z)|0D, (12.2.9) 
so finiteness of (12.2.8) follows from finiteness of the ghost-gluon vertex, 
which is a renormalization condition. 


12.2.9 Quark—gluon interaction, dw, 5; introduction of Bay 
Consider the Ward identity 


0 = 5 0| TH(xM(y)Ez)|0/52q 
= <0] Tdgwc|0> <0 TWdgWe|0> + <0| Tyy(d- A/é)|0>. 
Finiteness of the last term follows if we can prove dg = — ig, X tye, finite. 
(Note that 5, is related to ôk by the CPT transformation of (12.1.13).) 
Now X was defined by requiring the ghost-gluon vertex to be finite. An 
explicit proof, which we now give, brings in all the remaining operators 
listed in (12.1.9)—-(12.1.11), and in particular @,,.. 

In Fig. 12.2.9 we list all Green’s functions still to be proved finite. Observe 
that 6,c=0-A/é, so that it is finite if the gauge field is. Also Green’s 
functions of #,, or ¢ ^ č with dg and any number greater than zero of 
basic fields are finite, if the Green’s functions of Figs. 12.2.7 to 12.2.9 are 
finite. (Here 6,9 is the BRS variation of any elementary field ¢.) This proves 
Theorems 4 and 5, so it remains to prove finiteness of the Green’s functions 
illustrated in Fig. 12.2.9. 

The idea behind the proof is to examine the right-most vertex on the 
ghost line in Fig 12.2.9(a). This comes from the following term in the 
interaction Lagrangian: 


| Cabe Irl") AS d*x. (12.2.10) 


as <- M 
(a) < , logarithmically divergent, contributing to <0|Tôky yclo» 


(b) Lo. logarithmically divergent, contributing to ¢0|T@,,c|0> 


Fig. 12.2.9. Green’s functions not yet proved finite. 
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The derivative is on the line entering the vertex graph. By integrating by 
parts, we see that the vertex equals 


— IRCabc fatxterechae + €c"d- A‘). (12.2.11) 


In the first term the derivative is outside the loop-momentum integrals, so 
the degree of divergence is reduced by one. Factoring out the field 0,,c’ on 
the external line gives the basic vertex C œA, in the operator #,,. The 
second term in (12.2.11) contains 0-A which we shall relate to something 
else by use of Ward identities. 

We first prove finiteness of #,,, by formalizing the argument leading to 
(12.2.11). This is done in an unobvious way: 


11, <0] Tdgc(x)e"(y)e(z)|0> 
= (1/X){Z 0, <0| T ôr] 
+ GpX Cede <0| Tdgc*e(Af0%c%)|0> } 
+ gp LO|T RC (Cae: A*)|0 


ô =b 
= Jrz <0] Tôrc"Be(z)|0> 
= iJrCaa 0| Te“(x)e*(y)|0> d(x — z) 
+49p O| Tóg irla) 
ô : 
z Irz 6O| Tôgc"T BY(z)|0> 
= finite — $9, (0|Td,c*d- AE ^ c)°|0> 
= nO T dgc*e? B4(z)|0>. (12.2.12) 
Z 


The next-to-last line follows by the antighost equation of motion. The last 
line uses a Ward identity plus the nilpotence property 


ôrl ôg] =0. (12.2.13) 
Now the second term on the last line is finite (by Theorem 5), and the left- 
hand side of (12.2.12) is finite. So the last term on the right is finite. The only 


possible uncancelled divergence is of the form of Fig. 12.2.10, from which 
finiteness of 2,, follows. 


Vv 


7 


7 


Fig. 12.2.10. Only possibly divergent graph for (12.2.12). 
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Finiteness of ôky follow by the same manipulations applied to 
<0|Td,Wwel0>: 
O1.<0| Tégy(x)p(y)e*(z)|0> 
= gx (0|Te*p(x)h(y)|0>5(x — z) 
—F9R<O| TORWSgW(E A C)|O> (12.2.14) 


- gr- 0|T Su W(x (V)-B5(2)[0). 


12.3 More general theories 


In the last section we proved renormalizable the simplest gauge theories: 
that is, those with a gauge group with a single component and with fermion 
matter fields. More general cases can easily be treated by the same methods. 
The general result is that renormalizations are needed for each independent 
coupling in the basic Lagrangian and for the field strength for each 
irreducible field multiplet. Let us examine some specific generalizations. 


12.3.1 Bigger gauge group 


The gauge group can in general be a product of several components: 
G =| ]|?_,G,@U(1)”. Here each G, is a simple compact non-abelian group 
(like SU(N)), and there are v abelian U(1)’s. For each G, and for each U(1) 
factor there is an independent coupling g,. When we perform gauge fixing 
there will be a multiplet of ghost fields for each component of the gauge 
group. The proof of renormalizability will need no change. It will relate the 
renormalizations of the couplings within each multiplet. Thus for each of 
the n + v components of the gauge group there are renormalization factors 
X,,Z3;, and Z,, for the coupling, the gauge field and the ghost field. In 
addition there are the usual renormalizations for the matter fields. 

There are some special features of the abelian case which we will treat in 
Section 12.9. 


12.3.2 Scalar matter 
The part of the Lagrangian for a scalar field coupled to gauge fields is 
D" D, — V(¢", o). (12.3.1) 
Here Vis a function of ¢ and ¢' that is invariant under the gauge group, and 


D, is the usual covariant derivative. 
"For example, consider an SU(2) gauge theory in which h is a doublet 
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under the gauge group. Then the most general renormalizable form of V is 
V= mob + LU g'd)?, (12.3.2) 

while 

D,,o' Do =| p|? —ig A" Gt rI p +g ACA Ot ttt 
=|0,6|? —igA2gtt*0" + 497A? Oo. (12.3.3) 
Wave-function and mass renormalization are used to make the propagator 
finite, and the ¢'0@A coupling is proved finite as for a fermion. The self- 
coupling of the @-field is made finite by a renormalization of the Ad‘ o? 
term, which is the only four-point coupling invariant under global SU(2) 
transformations. One further Green’s function, viz., <0|T A A¢‘¢|0> has a 
potential logarithmic divergence. It 1s proved finite by the Ward identity: 
0=65<0|TéZAG'h|0 
=(1/E)<0|Td-AAd* $|0> + ¢0| TcdA G' pO 

+ <0|TCAdg'd|0> + (0| TEA G*5G|0>. (12.3.4) 


12.3.3 Spontaneous symmetry breaking 
Consider the abelian Higgs model as an easy example. Its basic Lagrangian 
1S 
L asie = — F2, + (ô, — ieA )G'(H + ied") — 32°" o -S 7/227)’. 
(12.3.5) 
The symmetry is spontaneously broken with <0|ġ|0> =f /(4 ,/ 2), in lowest 
order. So we write ¢ =[f/A+(¢, + if.) ]/ V2: 
L pasie = — ZF jy + (ef 7/207)A* + (ef /A)A, p + 5067 + 3063 —f°G7/4 
— A(t + G3)" /16 — Af plp + 3)/4 
+ eA.p, 0", + eA lp] + pi + 26 b,/A)/2. (12.3.6) 


If we quantize with the simple gauge fixing term 


basic 


es l A 2 

LaF -z A)’, (12.3.7) 
then renormalization is covered by the discussion of Section 9.2, where we 
treated spontaneous symmetry breaking in a non-gauge theory. We first 
renormalize in the unbroken theory (i.e., with f* + —f?). We need wave- 
function renormalizations Z, and Z,, coupling renormalization A? > A Y, 
and mass renormalization (which is effectively f?>f7?Z,,). Since the 
Faddeev—Popov ghost is a free field, the gauge-coupling renormalization is 
X = 1. After spontaneous symmetry breaking the same renormalizations 
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make the Green’s functions finite. Unfortunately the 4,0”, term in the free 
Lagrangian makes the Feynman rules rather messy, so it is convenient to 
use another gauge condition. We will discuss this in Section 12.5. 


12.4 Gauge dependence of counterterms 


To quantize and renormalize a gauge theory, we choose to fix the gauge in a 
particular way. It is important to show that physical quantities are 
independent of this choice. On the other hand, Green’s functions of the 
elementary fields can certainly be gauge dependent. Since it is the Green’s 
functions with which we work when renormalizing the theory, we must 
understand their gauge dependence and its effect on the renormalizations. 

One important class of physical quantities which we will treat is the set of 
Green’s functions of gauge-invariant operators. Also important is the S- 
matrix for physical states. In a spontaneously broken theory there are 
particle states that couple to the elementary fields of the theory. Some such 
states are physical, and we must prove that their S-matrix is gauge 
independent. But other states, like the Faddeev—Popov ghost, are unphysi- 
cal. We will not attempt a complete treatment here. 

When a gauge symmetry is unbroken there may even be no states 
coupled to the elementary fields. Indeed, it is commonly expected that in 
QCD colored states are confined. Certainly, within perturbation theory 
there are severe infra-red problems in obtaining the S-matrix for quarks and 
gluons. However, S-matrix elements of hadrons can be obtained from 
gauge-invariant Green’s functions. Consider, for example: 


COL Ti5.(W) i500) /5.()5,(2)|0>, (12.4.1) 


where j$, 1s the axial isospin current. Application of the LSZ formalism will 
give the S-matrix for nx > nr scattering. Similarly the gauge-invariant field 
Wi rêi is an interpolating field for baryons. Here y, is a quark field and i 
its color label. 

There are two sorts of gauge-dependence that we will consider. The first is 
where we change the gauge-fixing function F, to a different function of the 
fields. A specific application of our general results for this case will be given 
in Section 12.5 for the R,-gauge. The second type of gauge-dependence is a 
variation of the parameter č. Although variation of č by a factor À is 
equivalent to variation of F, by a factor 47 '/*, there are a number of special 
simplifications that will lead us to treat this second case first. 

The use of BRS invariance will make our computation of gauge 
dependence very simple. 
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12.4.1 Change of č 


The Green’s functions and the renormalization factors in general depend on 
č. We can use the action principle to compute the dependence on ¢ of a 
Green’s function <0|TX|0>. Here X denotes any product of local fields 
with no explicit dependence on €. Then 

ÔL 
ôE 


Y aL 
=i} gg [4*v<0] 7: FpO):X10 


$:40|7X|0) =i faty<o]r: (y):X|0> 


+ zia [A*¥<0LTF20):X10> (12.4.2) 


In the first term we use Y to denote any of the independent renormalization 
factors. As usual, we use the symbol: : to indicate subtraction of the 
vacuum expectation value. Next we use the form of the BRS transfor- 
mations to write 
(1/€)F2 = 6(6,F,) — ĉ,ôF (12.4.3) 
and substitute for FŽ in (12.4.2). We can use the Ward identity 
6<0|TcF X|0> =0 


and the equation of motion 


<0|T2,6F,(y)X|0> = —i X <0] Té,6X/5é,(y)|0> 
to give i 


- o SYS 0 ae. 
z 0lTX]0> =i ù zg |do r: zy :X10> 


-5 d+y0|T:2,F,:5X|0) + (N,/28)<0|TX|0). 


(12.4.4) 


Here N, is the number of factors of the ghost field in X. 

Suppose first that we choose to use the same counterterms for all values 
of č as at some particular value, say č =0. Then Green’s functions of 
elementary fields will not be finite except at this special value. But if we take 
a Green’s function of gauge-invariant operators then (12.4.4) indicates that 
the Green’s function is gauge independent, for such operators contain no 
ghosts. 
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Next consider a Green’s function of elementary fields: 
N 
<0|TX|0> =<OlT]] b(xt)|0>. (12.4.5) 
i=1 


The renormalizations in Z are adjusted to keep Green’s functions finite for 
any value of é. Then in (12.4.4) we find the €-dependence of the counterterms 
by requiring the 


OY OL 
Da OY 
term to be the sum of the counterterms needed to cancel the divergences of 
—1¢,F:/(2¢). 
We consider the ultra-violet divergences of 
<0|T:¢,F,(y):6X |0>. (12.4.6) 


Divergences in (12.4.6) occur either when y coincides with some set of 
interaction vertices or when it coincides with one of the x,’s, which are the 
positions of fields in X. We can renormalize Green’s functions of :ĉ,F,: with 
elementary fields by adding to it an operator :D(y):. Since we choose X to be 
a product of elementary fields, the only remaining divergences in (12.4.6) are 
when y coincides with the position x of a BRS varied field 6¢@(x) in 6X. To 
renormalize this divergence we need counterterms to (12.4.6) of the form 


N 
<0|T :D(y):6X|0> + » ôy — x) O| TX |0) |si- iB oxo 
i=1 


(12.4.7) 


The counterterm when y coincides with d¢(x) has been written as 
— iE,(x)6(y — x), there are possibly derivatives of the d-function, and the 
normalization factor — i will be convenient later. 

By use of Ward identities and equations of motion, we can write (12.4.7) 
as 


~ <0|T :5D(y):X|0) — F <0|T :E,(y)dS/5o(y): X10). (12.4.8) 
$ 


This is the most general form of counterterm needed to keep the Green’s 
function finite as č varies. It is even the correct form for the change in 
counterterms caused by a change in the form of F, as we will see in Section 
12.4.4. To proceed any further we must do power-counting to determine 
which counterterms actually occur. 

We now restrict attention to the gauge condition F, = 0-A,. The only 
divergent elementary Green’s functions of:¢,F,:are illustrated by their 
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a SY 


Fig. 12.4.1. Lowest-order graph for Fig. 12.4.2. Lowest-order graph for 
<0|Tc :¢,F,:|0>. <0|Tc A :2,E,:|0) 


lowest-order cases in Figs. 12.4.1 and 12.4.2. The divergence of the first 
graph has a factor of the derivative at the ghost interaction. The 
counterterm for Fig. 12.4.2 is proportional to (0,,¢,)A 4 = derivative — ¢,F,. 
Since we integrate over all y, the only contribution to the counterterm 
operator D is proportional to ¢,F.,. 


N 
N 
N 
Si E 
Six 
Ča F C.F 


Fig. 12.4.3. Lowest-order graphs for :¢,F,: with BRS variation of a field. 


CaF a 


The divergence with a BRS-varied field 6¢ is logarithmic and pro- 
portional to ¢, as in the graphs of Fig. 12.4.3. So we have E, oc ġ. This 
corresponds to a variation with č of the wave-function renormalizations 
with gọ and M, fixed. For example, 


ôZ, ðZ  1ôlnZ, 
OE oZ, 2 0k 


A complication arises since we know € is not renormalized: 


6Z,0L _1énZ, 


——? (WdS/d + PSJ) (12.4.9) 


Sain ae a 2 
ôč ôZ, 2 ôé e + Fs (25) 
= on aes goS/5A + F2/22). (12.4.10) 


Putting all this work together, we find 


0 l ôln Z, e p 
5e60[TX|0> = -(1 a ) faty<olrse,F,0:6x|09 


OlInZ, N,OlnZ, 
7 + aE 
ala 2 ô 


ôln(ŽZ;?) č 
n| eee + als (12.4.11) 


~ <O|TX]O>4N 


This is just the counterterm structure we need. The operator ¢,F, is 
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multiplicatively renormalized by a factor 1 + čôln Z,/ôč. (In an abelian 
theory no renormalization is needed since c, is a free field; so Z, is 
independent of ¢.) The other terms (where N, is the number of external ¢ 
fields of X) come from renormalizations of 1PI graphs including both ĉ,F 
and a ôg. 

Note that dln Z,/0& occurs in two places. This comes from assuming that 
the gauge-fixing term is — F?/(2¢)= —d-A?/(2é) with no extra re- 
normalization factor. We proved this from the Ward identity: 

0=6<0|T2C,(x)F,(y)|0> 
= (1/€)<0| TF,(x)F,(y)|0> + ¢0| Te, 6F,|0> 
= (1/€){0| TF,(x)F,(y)|0> — i5(x — y)5gp5 (12.4.12) 


where the last line follows from the ghost equation of motion. Since 
F,=0-A,, its Green’s functions are finite, and so € is finite. 

We can use(12.4.11) to prove gauge invariance of the S-matrix by picking 
out the residue when we go to the particle pole for each external line. 

If we use the correct, ¢-dependent counterterms in Z, then a gauge- 
invariant operator like 


(G4)? = (0,A% — 8,45 + JoZ 3” Cay, A AS)? (12.4.13) 
is actually é-dependent; from (12.4.11) we see that is proportional to Z;'. 
But the bare operator 
(Gio) = (ô A voy z OA Ou T JoCabcA (oyu A (oy) 


is gauge independent. Both operators have UV divergences, so that they 
must be renormalized. 


a 


12.4.2 Change of F, 


Let us assume that F, depends on a parameter x. This allows us to 
interpolate between two different gauge-fixing conditions. We follow the 
same method as for the €-dependence. The change in the Lagrangian is 


OL 0YOL 1 ôF, _ OF, 
=) —— --F,— —¢,6 
ôk Vex OY € ök OK 


YOL OF 
Se a aE 12.4.14 
2 ak BY (3 =) ia. 
So 
0 EON a4 OLY) 
, _ OF, 
+i fatyco|Tie Ax :6X|0>. (12.4.15) 
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The counterterms for the second term have exactly the same form as we met 
in the €-dependence of a Green’s function, i.e., the form (12.4.8). Until the 
form of F, is specified we can make no further statement about the 
necessary counterterms. We will examine an example of this in Section 12.5. 

In the most general case we can take a Green’s function of gauge- 
invariant operators and omit any x-dependence of the counterterms in Y. 
Then the Green’s functions are gauge-independent: ô< 0|TX|0>/ðk =0. 


12.5 R.-gauge 


To eliminate the mixing between the gauge field and the scalar field in 
spontaneously broken gauge theories one can use the R.-gauge devised by ’t 
Hooft (1971b), Fujikawa, Lee & Sanda (1972), and Yao (1973). Let us use 
the Higgs model of (12.3.5) as an example. The R.-gauge is defined by using 
the gauge-fixing Lagrangian 

Z = — (1/2E)(0-A — KEmg,)’. (12.5.1) 
We use the parameter x to interpolate between the R,-gauge, where x = 1, 
and the gauges we used in Section 12.2, where x = 0. Also, m= ef /A. 


An interacting Faddeev—Popov ghost is needed, even in an abelian 
theory (which is the only case we will explicitly treat): 


L yo = 0,0" — KEM? — eEmKd , Cc. (12.5.2) 
The BRS variations of the fields are: 
0A, = 0,¢, 
0p, = — (m + eg,)c, 
0p, =eg 6, (12.5.3) 
dc = (0: A — Cxme,)/¢ = F/¢, 
dc = 0. 


Observe first of all that the extra couplings relative to the gauge x = Oare 
all super-renormalizable, so that the renormalizations of the dimensionless 
couplings can remain unchanged. These are X (which equals unity in an 
abelian theory), Y, Z,, and Z,. To determine what further renormalizations 
are needed, we use the method of Section 12.4.2, with 0F/dx = — čmo.. 

Thus we need the counterterms to 


= igm [aty<0]T:2630):3x10>. (12.5.4) 


The only divergent graphs contain ĉġ,, the BRS variation 6¢,, and no 
other external lines. They have the form of Fig. 12.5.1, which is logarithmi- 
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Fig. 12.5.1. Divergent graphs for (12.5.4). 


cally divergent. Notice that interactions on the ghost line would make the 
graph convergent. The counterterm to (12.5.4) therefore has the form 


C [ay 0179/80, (10) 


= ic [a*y<o T:6S/0@,(y):X|0>. (12.5.5) 


This counterterm is generated by a shift of ¢@, in the Lagrangian: 


Lip, +m, e, pġ, 4,¢,€) > L(b, + Bm/e, pz, A,c, ©) (12.5.6) 
with | 


mog (12.5.7) 


As far as renormalization is concerned, the effect of using the new gauge 
condition is to generate an ultra-violet divergence in the vacuum expec- 
tation value of the scalar field. Although this might appear strange, it is 
permitted to happen since the field is not gauge invariant. 
Gauge-invariant Green’s functions are unchanged, of course, as is the S- 
matrix. The ghost field, ¢,, and the longitudinal part of A, all couple to 
unphysical states. In this abelian theory, F „is a gauge-invariant field which 
couples to the transverse part of the gluon, while 
ZmZ2P'b = ZZ 1(p, + Bm/e)’ + $3] 
= Z,,Z,(Bm/e)* + ¢,[2(Bm/e)Z,,Z,]+°°° 
is gauge invariant and renormalized, and couples to the @¢,-particle. 
The Lagrangian can be written out in terms of the fields, to exhibit the 
interactions. It is rather fearsome-looking. We set x = 1 and find 
L = —42,;F 7, — (1/2€)0-A* + 3m*B*Z,A* + 0c 
— m° Béčc + A,0"p,m(Z,B — 1) 
+42,0? + 063) — ip?m 2 e7? (3B? Y— Z,,) 
—202m°[¢ + 4/°e (B? Y- Z,,)] 
— T64 Yli + 3)? — 42?me" 'BYp (7 + $3) 
— ij?*mee” > B(B?Y— Z,)d, 
+ eZ, Ay 0"b, +4 ZAI + $3) 
+ emBZ,o,A; — čmo ,éc. (12.5.8) 
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The same methods can be applied to a non-abelian theory with extra 
complications in the structure of the vertices — see Fujikawa, Lee & Sanda 
(1972). 


12.6 Renormalization of gauge-invariant operators 


In applications such as to deep-inelastic scattering we need the operator 
product expansion of gauge-invariant operators. It is natural to assume 
that only gauge-invariant operators appear in the expansion. Moreover, in 
using the operator product expansion, we need the anomalous dimensions 
of the operators. To compute these, we need the expression for the 
renormalized operators in terms of the bare operators. Again, it is natural to 
assume that only gauge-invariant operators are needed. 

Both assumptions, taken literally, are false (Dixon & Taylor (1974), and 
Kluberg-Stern & Zuber (1975)). We will first treat the renormalization 
problem, where certain gauge-variant operators mix with gauge-invariant 
operators. As we will see, the gauge-variant operators vanish in physical 
matrix elements (Joglekar & Lee (1976), Joglekar (1977a and b)). We will 
simplify many parts of the proof by working in coordinate space. Then we 
will apply the same methods to find the operators that appear in the 
Operator product expansion. 

In the case of an ordinary global symmetry (like Lorentz invariance or 
isospin), an operator that is invariant under the symmetry mixes only with 
invariant operators. In the case of a gauge theory, we break the invariance 
of the action by gauge-fixing, leaving only a BRS invariance. But a gauge- 
invariant operator even mixes with operators that are not BRS invariant. 
How can this be? | 

To answer this question, we consider an operator ( that is invariant 
under some given transformation. Let C be the sum of its counterterms, and 
assume the action is invariant. Then the Green’s functions are invariant 
under the transformation, so 


0=6<¢0|TX(0+C)|0» 
=<¢0/T6X(0+C)|0>+ <0ITX 6(0 + C)|O> 
= <0|T6X(O+ C0) + <0|TX ôC|0>. (12.6.1) 
Now C is defined so that Green’s functions of © + C with elementary fields 
are finite. For an ordinary symmetry, the variation of an elementary field is 
again an elementary field, so the first term on the right of (12.6.1) is finite. 
Hence the other term, an arbitrary Green’s function of ôC, is finite. If we use 


minimal subtraction, the counterterms are powers of 1/e,so ôC is finite only 
if it is zero. Thus counterterms to an invariant operator are invariant. 
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But when 6 represents a BRS transformation the variation of an 
elementary field is composite. So finiteness of <0| TX (0 + C)|0> does not 
follow from finiteness of the elementary Green’s functions of O + C. Hence 
the counterterms to a gauge-invariant operator need not be BRS invariant, 
let alone gauge invariant. 

This argument also shows us how to handle the problem. Let X be a pro- 
duct @¢,(x,)...dy(xy) of local fields. Then the only divergences of 
<0|T5X (O(y) +C(y))|O> that lack a counterterm are when y coincides with 
the position of a BRS-varied operator 6¢,(x;) in 6X. Hence, provided y is 
not equal to any x,, ¢0|/T6 X(O(y) + C(y))[0> is finite. From (12.6.1) we then 
see that <0|T X 6C|0) is then finite. But if we use minimal subtraction this 
means that it is zero. Hence an arbitrary Green’s function ¢0|T X 6C|0> of 
ôC with elementary fields is zero. Thus, as an operator, dgp,C = 0: the 
counterterms to a gauge-invariant operator are BRS invariant. 

One can try verifying this theorem by explicit calculations (e.g. 
Kluberg-Stern & Zuber (1975)). These apparently contradict the theorem. 
However, the non-invariant counterterms must correspond to operators 
that vanish by the equations of motion. As usual, the treatment of 
derivatives of fields in covariant perturbation theory implicitly generates 
commutator terms. After Fourier transformation into momentum space 
the non-invariant operators are not manifestly zero. 

What we are actually interested in are gauge-invariant operators rather 
than merely BRS-invariant operators, for it is only the gauge-invariant 
operators that have physical significance independently of the method of 
gauge-fixing. So we must show how it is that the gauge-variant operators 
that mix with © do not enter into physical quantities. 

Let ©,,...,0, be the set of gauge-invariant operators that mix with 0. 
These are the operators which have the same transformations under global 
symmetries (e.g. Lorentz, isospin) as ©, and whose dimension is at most that 
of ©. Choose this set so that it is linearly independent (after use of the 
equations of motion). There are three other classes of operators that mix 
with 0: 


Class A These operators, A,, are the BRS variation of some operator: 
A, = 0A; 

Class B These operators, B; vanish by the equations of motion. 

Class C Any other operators that mix with 0, and that are not linear 
combinations of A,’s, Bs, and @,’s. 


The nilpotence of BRS transformations ensures that 6A, = 0 up to terms 
vanishing by the equations of motion. 
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The most obvious classes of BRS-invariant operators are the gauge- 
invariant operators 0; and the BRS transformation of operators 6A,. The 
first result to prove is that there are no others; 1.e., there are no operators of 
class C. The second result is that the renormalization matrix expressing the 
renormalized operators [0], [A], and [B] in terms of unrenormalized 
operators is triangular: 

[O] [Zee Zoa Zos\ |0 
[A]]=| 0 Zia Zap] [A[. (12.6.2) 
[B] 0 0 Zz] \B 


It is easy to prove this second result. The only operators that can be 
counterterms to a class B operator must themselves vanish by the equations 
of motion, i.e., they are of class B. A Green’s function of a class A operator 
6A, can be written as 

<0| T5A,{y)X|0> = — (0|TA,6X|0). (12.6.3) 
Thus if the counterterm to A, is C(A,) then 
<0| T[A; + C(A)]5X|0> 
is finite if y equals none of the x,’s. Hence the counterterm to 6A, is 
5C(A,), modulo terms vanishing by the equations of motion. That is, the 
counterterms to A, = 6A, are class A and class B only. 

The proof that there are no class C operators is somewhat complicated. It 
is essentially a mathematical exercise in homology theory. We refer the 
reader to Joglekar (1977a, b) and Joglekar & Lee (1976) for proofs. 

The importance of these results is as follows: both class A and class B 
operators vanish in physical matrix elements. For class B this is because of 
the equations of motion. For class A we obtain the matrix element by the 


LSZ reduction formula from a Green’s function ¢0|TA;X|0>. But there is a 
Ward identity 


<0|T6A4,;X|0> = — ¢0|T4,6X|0D. (12.6.4) 


Whenever X is gauge invariant we get zero. But we may also use elementary 
field operators in X. A term in 6X with a varied field operator 6¢(x) has no 
physical particle pole for this line. This can be seen by examining the 
possible Feynman graphs, and observing that ô contains a ghost field. 

If the renormalization matrix in (12.6.2) were not triangular then 
renormalized operators of classes A and B would be non-zero on-shell, even 
though the unrenormalized operators are zero. The triangularity ensures 
that for physical matrix elements we have 


[0] = Zo0. (12.6.5) 


Thus we can disregard the non-invariant operators. 
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12.6.1 Caveat 


In practical calculations with Feynman graphs one must beware of taking 
(12.6.5) too glibly. Consider the calculation of [G32] in QCD. The only 
gauge-invariant operator to mix with it is Ww, so we can write (12.6.5) as 


bs . (7 i 2 ( -a (12.6.6) 
[yy] 0 222/\ wy 
The Z,,-coefficient is zero because wy has lower dimension than G?. 


WY 4 


Fig. 12.6.1. One-loop graph for ¢0|TAA[Gi,]|O>. 


We compute the one-loop term in Z,, from the graph of Fig. 12.6.1. 
There are various tensor structures for the counterterms. The G$; 
counterterm is proportional to 


JuyP*(P +4) — (Pp + 4) P- 
Tensors that vanish on-shell are, for example, 


IP? — PPs  Gy(P + 4) —(p + 4), (Pp + a), 

To simplify the calculations it is tempting to set q =0. But then these 
three structures are equal and it is not possible to separate the three 
coefficients. If one sets p? =0=(p+q)* so that p and q are on-shell, 
and then multiplies the graph by polarization vectors “e” satisfying £: p = 
0 = «’-(p +q), then one has taken a physical matrix element and only the 
G** counterterm survives. 

The calculations of Kluberg-Stern & Zuber (1975) are at zero momen- 
tum, and they have to go to some effort to overcome the above problems. 


12.7 Renormalization-group equation 


Renormalization-group equations are derived in gauge theories just 
as in any other theory. One feature that is easy to overlook when making 
calculations is the variation of the gauge-fixing parameter under a 
renormalization group transformation. We consider the theory (12.1.1) and 
let Gy. n,n, be the renormalized Green’s function with N, external gluons, 
N; fermions, N; antifermions, N, ghosts, and N, antighosts. The cor- 
responding bare Green’s function is 


(0) — JN.FNe Ng/2 
GN.,Ni, Ne = Z ‘Z3 Z3 GN., Ne, Ng’ 
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and it is RG invariant: 


d 
0 = yo GO 
Maa 
3 pô yâ 
2 L (0) 2.7.1 
CL MM - 865) vere 


Here $, yy and 6 are obtained by requiring gọ, Mo, and čo = Z;¢ to be RG 
invariant. 
In the minimal subtraction scheme we have 


Jo = pe“? 1+ È 2 a(g) (4 — ar | 


Zy=it 3 b,(g)/(4 — d)", (12.7.2) 


Z,=1+ }, Ci n(g, $4 — d)". 
n=1 
Here Z, stands for Z,, Z}, or Z. The renormalization-group coefficients are 


B = (d/2 — 2)go/(g90/09) 
=(d/2 — 2)g + 1ga, /ôg, 
y u= bôln Zy/ðg = — 39gðb,/0g, (12.7.3) 
y; = Pô ln Z;/ôg = — 390c; 1/09, 
ô = Y3 = — 3 90C3 1/09. 
Similar formulae hold in any other subtraction scheme. 
Observe that, by the results of Section 12.4, g, and Z,, are independent of 
č so that $ and y,, are independent of €. But Z,, Z, and Z depend on €, so 
that y,, y3, Y and 6 also depend on €. As a special case, in an abelian theory, 
Jo =u? "7gZ;'!*, so that Z}, y, and 6 are independent of é. 
The renormalization-group equations for renormalized Green’s func- 
tions are then 


d : 
ea 


T 


= (at ha? Mam °° 3% St Na+ Nm +i iN 6 (12.7.4) 


Since the anomalous dimensions are é-dependent, the solutions of the RG 
equation are a little complicated. The most convenient gauge to use is the 
Landau gauge č = 0, for then € does not vary when a renormalization group 
transformation is made. Tarasov Vladimirov & Zharkov (1980) have 
computed £$ to three-loop order in this theory. 
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12.8 Operator-product expansion 


Consider the Green’s function 


M(y3X1)---) Xy) = (0 TH) (O)X 0). (12.8.1) 
where j is a renormalized gauge-invariant operator, and X is a product of 
local fields at points x,,...,x,. We have the Wilson expansion 

M(y3X1,---.Xy) ~ 2 C{y)<0| TLO,(0)]X|0>. (12.8.2) 


The sum is over all renormalized operators of the appropriate dimension. 
This expansion is proved in a gauge theory the same way as in any other 
theory. 

In the application to deep-inelastic scattering in Chapter 13, we will take 
X to be a product of two gauge-invariant operators and use the LSZ 
reduction formula to obtain a matrix element: 


W=<P|Tj(y)j0)|P> _ 
=) C;< P|[O(0)]|P>- (12.8.3) 


We would like to show that only gauge-invariant operators need be 
included in (12.8.3). Now the proof of the operator-product expansion (in 
Chapter 10) treats the right-hand side of the expansion in the same way as 
renormalization counterterms. So the method we applied to the re- 
normalization of gauge-invariant operators also applies to the Wilson 
expansion. The operators (, are either gauge invariant, are the BRS 
variation of something, or vanish by the equations of motion. Then if we 
keep y #0 in (12.8.3) (as is the case in applications) we only need gauge- 
invariant operators. 
The renormalized operator j has, according to Section 12.6, the form 


j = ja + OpA T B, (12.8.4) 


where j,, is gauge invariant and B vanishes by the equation of motion. 
Hence if we take a matrix element of j(y)j(0), like (12.8.3), (or if we take a 
gauge-invariant Green’s function of j(y)j(0)), then we can drop the 4 and B 
terms, so that 


«P| TIOJ P> =< P| Tigh) icgt)|P >. (12.8.5) 

We now follow our proof in Section 12.6, starting with the Ward identity 
0 = Spas O| Ti(y)i()X [0> 

= (0| Ti(y)j(0)6.X |0). (12.8.6) 


Here we assumed that the positions y,, x,,, and 0 are all distinct, so that 


iw? 


ôj=ôB=0 
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by the equations of motion. But if y->0, then none of the x,,’s are at the 
origin, so that: 


<O Ti(y)j(0)6X|0> ~ > Cy) <0] TLO(0) ]6X|0> 
= — YCy)<0|T[50(0)]X|0>. (12.8.7) 


But the left-hand side is zero, by (12.8.6), while the right-hand side is its 
leading behavior as y— 0 and is therefore also zero. If an arbitrary Green’s 
function of an operator is zero, then the operator itself is zero, 1.e., 


YC; [60,] =0. (12.8.8) 


Hence the operators needed in the expansion are those that are BRS 
invariant. As in Section 12.6, this means they are either gauge invariant or 
of classes A and B. In physical matrix elements like (12.8.3) we can therefore 
restrict the operators to be gauge invariant. 

The practical use of this result is to compute the Wilson coefficients by 
taking the state | P to be a state of one on-shell quark or gluon. The infra- 
red divergences are regulated by going to d > 4. The Wilson coefficients and 
the gauge-invariant renormalization counterterms of the 0, can be 
unambiguously obtained provided care is taken to separate the IR from the 
UV divergences (both of which appear as poles at d = 4). 


12.9 Abelian theories: with and without photon mass 


Consider QED with a possible mass term for the photon. The Lagrangian 
can be expressed in terms of unrenormalized or renormalized fields: 


faye S a(F + zm (AP) + Wo(iD — Myo 


= —1Z,F?,+4m3Z,42+Z,MiD—- MW. (129.1) 


Here we have a single vector field 4, and we let 


fedac A (12.9.2) 
Dw =(0, - ieyAY 
= (ð, —iegZ'!24 Y. (12.9.3) 


We will call A, the photon. Without a mass term for the photon, the 
Lagrangian (12.9.1) is invariant under the gauge transformation 
A At 0,0, 
Y > wexp(ie,Z3/7a), (12.9.4) 
ý > y exp(— ieoZ3” w). 
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With a mass term the Lagrangian is not invariant, but the theory can be 
consistently treated, as we will see. 

In contrast, a non-abelian theory with a gluon mass term is not 
consistent. For example, there are unphysical states — the Faddeev—Popov 
ghosts — which violate the spin-statistics theorem, and the negative metric 
gluon states. In the massless theory these cancel in sums over intermediate 
states, but in a massive non-abelian theory they do not cancel. These 
problems do not occur in the abelian theory with a massive photon. 


12.9.1 BRS treatment of massive photon 


Even with a mass term for the photon, the Lagrangian is BRS invariant, if 
we use the standard gauge fixing. We write 

L = F iny —(1/2€)d-A? + 4,60"c — Emo Z Cc. (12.9.5) 
and define BRS variations 

6A, =0,c, dp =ie,Z}? cy, dh = —ie,Z3!*Wc, 
ôc =0, 6c= —0:A/E. (12.9.6) 

Then X is BRS invariant, aside from an irrelevant total divergence. Since 
the Faddeev—Popov ghost is a free field, it can be omitted without affecting 
any physics (except gravity). 

We may use the general methods of Section 12.2, with the result that the 
theory is renormalizable. Since the ghost is a free field, the renormalization 
factors X and Z are both unity. Hence 

= e.Z2, 71° (12.9.7) 
and the renormalization of e is gauge independent. (If we use minimal 
subtraction we write eg = y?” e.) We now find that e,A = e,A,, and 
that the covariant derivative and gauge transformations simplify: 

D,Y = (0, — iepA WW, (12.9.8) 
y > eey, (12.9.9) 
Using this method we also find that m = Z3 'm?, so that the ghost field has 
finite mass. Then the Lagrangian is 
L = —4Z,F}, + 3m? A? — (1/2¢)0: A? 

+ Zli +e4A — Mo, (12.9.10) 
with the ghost ignored, since it is a free field. Note that there is no 
counterterm for the photon mass. 

Similar methods apply in theories where the gauge groupis a product ofa 
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non-abelian group and one or more U(1) factors. The Weinberg—Salam 


theory of weak and electromagnetic interactions is a simple case — with 
gauge group SU(2) @(1). 


12.9.2 Elementary treatment of abelian theory with photon mass 


The BRS methods are much more sophisticated than necessary for the 
abelian theory, so we will now treat the theory by elementary methods. 
First we write the free photon propagator for the perturbation theory of the 
Lagrangian (12.9.10): 


a ae a o kk 
ae EREE Jay + k, k,/m*) m2 k? — Em? + ie 


i 1 — €)k,k, 
-arrg ~ 90 t val (12.9.11) 
If m=0, then we cannot remove the gauge-fixing term, for then the 
propagator does not exist. (This is the same as taking the limit č > œ.) But if 
m is non-zero, then the propagator exists when the gauge-fixing term 
is removed. However it behaves like ik,k,/k?m7, rather than 1/k?. So the 
theory with m =0 and é = œ has worse divergences than usual and is not 
manifestly renormalizable. Even so, physical quantities are independent of 
č and the theory is renormalizable if é is finite, as we will see. Hence enough 
cancellations are present as č > œ that the theory remains renormalizable if 
we only compute physical quantities. 
Our treatment makes extensive use of the equations of motion: 


y -^ 
Au RA, 
= Z,mA" + Z(O A" — 040: A) + 08- A/E +eoZ3 Wy"WZ, 
— 0, 
AS 
ee Ky oe Owe’ 
AS ao a 


We have not assumed the relation (12.9.7). Taking the divergence of the 
gauge-field equation of motion gives 
0 =(0/é + m’)d-A. (12.9.13) 


Here, we have used the invariance of the theory under the global 
transformations, which are (12.9.4) with constant œw. The Noether current 
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for this invariance is the electromagnetic current: 
jh = —eoZ3?Z Wy". (12.9.14) 


This is conserved, by the electron’s equation of motion. 
The result (12.9.13) that ô: A is a free field is important for three reasons: 


(1) It would otherwise be difficult to interpret the longitudinal part of A, 
which has unphysical properties. 

(2) The ¢-dependent part of the propagator is confined to the k,k, term, 
which only contributes to d-A. Then the €-dependence decouples from 
physics if ð- A is free. 

(3) Similarly the bad ultra-violet behavior when & — œ is decoupled from 
physics. 


To make these statements precise we will examine the Green’s functions. 
We will prove directly: 


(1) Ward identities, 

(2) mg = Z3 'm?, so that no mass counterterm is needed for the photon, 
(3) no counterterms are needed for the ô: A* term, 

(4) eo =epZ; 1°, with e, finite, 

(5) Z, but not Z, depends on é, 

(6) the S-matrix is independent of é. 


We will also compute the exact ¢-dependence of Z, when minimal 
subtraction is used, and we will compute the exact €-dependence of the 
residue of the electron’s propagator pole. 


12.9.3 Ward identities 


Green’s functions of Z 4, Z, and Z; are non-zero, since the derivatives in 
these operators are implicitly taken outside the time-ordering. To obtain 
the Ward identity corresponding to (12.9.13), we use identities like 


. AX 
(0| TY, (x)X|0> lea (12.9.15) 

Now 
(D/E+m’)d-A=6,L£4, —iePL; + ieL yw. (12.9.16) 


Wechoose e) = epZ3 "°, m = Z3 'm*. This gives a Lagrangian of the form 
(12.9.10). There are no counterterms for A? and 0: A’, and the counterterm 
for w Aw is proportional to the counterterm Z, — 1 for Paw. We will prove 
later that this is correct. But for the moment we will choose to have our 
Lagrangian in this form. Our gambit is that if extra counterterms should be 
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- needed, then they will not be available and the Green’s functions will 
diverge. 
From (12.9.16) it follows that 


(C,,/€ + m)<0|Td- A(x) X|0> 
ô 
= izz <0]| TAX/AA (x)|0> 


+ ey (0| TY(x)AX/AY(x)|0> — ep (0| TH(X)AX/AY(x)|0) 


= i$ (0|TX|0), (12.9.17) 


where the gauge variation is computed according to (12.9.4). 
A convenient way to write these results is as follows: 


(1) Let @ be a free scalar boson field of mass &'/2m. 
(2) Let d(field) = gauge variation of ‘field’ with parameter 4, i.e., 


6A, = 0,0, 
ôy = lenoy, 
oy = —iepo. 


(3) Then (12.9.17) is 
(1/€)<0| TA-AX|0> = (0| Th(x)dX|0>. (12.9.18) 
The field ¢ is similar to the Faddeev—Popov ghost except for being a boson 
(this will be important later). In a non-abelian theory the nilpotence of the 
BRS transformation is crucial to proving Ward identities and is proved using 
the anticommutation of c with itself. A field introduced in the same way as ¢ 
has to be the ghost field and must be a fermion. In (12.9.18) we can choose @ 
to be a boson. 


12.9.4 Counterterms proportional to A? and 0: A? 
Let us apply (12.9.17) to the case X = A (y): 


(C/E + m?)-— =<] TA"(x)A,(y)|0> = (12.9.19) 


This equation implies that no counterterms proportional to A? or to 
0: A? are needed. For suppose otherwise. Then consider the lowest order in 
which there is a divergence in the photon propagator. The divergence comes 
from the insertion of a divergent self-energy graph in the free photon 
propagator. The divergence has the form of Fig. 12.9.1, where the cross 
denotes the counterterm to this divergence. Power-counting indicates that 
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mrp 
Fig. 12.9.1. Counterterm to photon’s self-energy. 


the only divergences are proportional to k,k, and g,,. A divergence g,,,k? 
— k,k,is cancelled by the wave-function renormalization Z, for the photon. 
Insertion of the extra counterterms in the propagator gives a divergent 
contribution to the left-hand side of (12.9.19). But the left-hand side is finite. 
So there are in fact no divergences. Hence mẹ = Z3 'm’. It also follows that 
the photon self-energy is transverse. 

Applying (12.9.17)to X = A (y)A,(z)and to A,(w)A ,(y)A,(z) shows that no 
counterterms cubic or quartic in A are needed. 


12.9.5 Relation between e, and Z, 
Apply (12.9.17) to X = W(y)(z): 


(D/E +m?) O| TACO |0> 
= eC0|TWyIM|0>[5-e—y)—Hx-2)]. (129.20) 


We assumed ey = Z; */*eg. If this is not the correct counterterm, then let the 
w—p—A 1PI vertex first diverge at N-loops. Then the left-hand side of (12.9.20) 


diverges at order eg” *1, while there is a counterterm Z, — 1 available to 


make the right-hand side finite. Hence the left-hand side does not diverge. 


12.9.6 Gauge dependence 


The gauge variation of a Green’s function <0|TX|O> is 
i 
265 
p a In Z, 
0g 


536 0|TX|0) = [atx<o|[o-4%1x) — Co|a-47]0)x|0> 


[atx<olry-z,x)o> 
i 

2 

- N,<0|TX|0>ôIn Z,/d¢ 

=5 fatx <o] TE- (0/420) ]52x|0> 
~N,<0|TX|0) an Z,/0. (12.9.21) 


dtx<0| Tô: A(x)(x)dX |0 


Here N, is the number of y fields in X (which equals the number of Wy fields). 
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2 , 
ae TAs A,jO> = —-~+--~— = lowest order 
i A 


Fig. 12.9.2. Examples of (12.9.21). 


The disappearance of the factor 5 in the last equality is due to the identical 
nature of the two ¢ fields. Also, 6? — (0|¢7|0) appears, rather than ¢?, 
because the pure vacuum graphs never occur on the left-hand side. (A better 
derivation uses two different fields @ , and ¢, for each application of the Ward 
identities.) Notice that we cannot derive this equation if ¢ is a fermion, for 
then ¢? = 0. So, in anon-abelian theory, the derivation must be replaced by 
the more complicated one which leads to the weaker (12.4.4). The graphical 
structure of (12.9.21) for a few simple cases is shown in Fig. 12.9.2. 

From (12.9.21)itisimmediate that all Green’s functions of gauge-invariant 
fields are €-independent. 

The only graphs with UV divergences are those with the simple loop 
(Fig. 12.9.3) attached to a w or a w vertex together with those with the 
dln Z,/0€ factor. We therefore find that 


ôlnZ, . | d*k 1 


is UV finite. Hence 


oln Za Em 2 \d/2-2 P 
J = EF < ro- aa)( Jr :) + finite. 
i 
. J 


Fig. 12.9.3. All UV divergent graphs contributing to (12.9.21) contain this loop. 


12.9 Abelian theories: with and without photon mass 329 


If we use minimal subtraction then 


2 
Z,=Z,(E =O)exp al (12.9.23) 


To obtain the €-dependence of the S-matrix, we recall the LSZ formula. It 
tells us to consider the corresponding Green’s function, and pick out the 
poles in its external momenta. Since we use transverse polarization (e-k = 0) 
for photons this picks out graphs with the loop Fig. 12.9.3 and the ô In Z,/0€ 
terms. Then 


p-M p? — m? 2 
ext 2 ext 3 
Fermions photons 


From Fig. 12.9.2 for ¢0|Tw|0> and for ¢0|TA,A,|0>, we see that the 
physical masses and the residue of the photon pole are €-independent, while 
the gauge dependence of z, exactly cancels the gauge dependence of the 
particle pole coefficient of <0| T |0). 

Wecancomputeexplicitly the €-dependence of the residue, z,, of the pole of 
the fermion propagator, if we use minimal subtraction: 


sae ee (12.9.22) 


0 d>4 


A 
~ 1602} 279 Any? j | 


2 2 
z, = z,(č = 0)exp l ES, £ —1+in = Ji (12.9.25) 


Since €) = čZ,, eg = Z3 !e°, and mo = Z; 'm’, the combinations e7é and 
Em? in (12.9.25) are RG invariant. 


12.9.7 Renormalization-group equation 


Using our knowledge of the renormalization of e,, čo, and mg we find the RG 
equation for a Green’s function of N,w’s, N,w’s, and N, A’s to be 


1 


ae + Ney, +3N473)G, 


(12.9.26) 


0 0 0 
O= (He +B- uM + yams S 
with 
B = 5ey3 + (d/2 — 2)e. 


If we use minimal subtraction then the only €-dependent coefficient is y,,and 
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from (12.9.23) we find that 
Y2 = Y2(€ = 0) + če? /8n?. (12.9.27) 


Theresults (12.9.23), (12.9.25), and (12.9.27) were derived by Lautrup(1976) 
and by Collins (1975a). 


12.10 Unitary gauge for massive photon 


The unitary gauge is the limit č > œ. Since the S-matrix and gauge-invariant 
operators are €-independent, this limit exists for them. But for gauge-variant 
operators there are severe ultra-violet divergences. Thus the limits € > œ, 
d—4 do not commute. 

We may take -— œ first, in the regulated theory. The resulting UV 
divergences at d = 4 may be cancelled by extra counterterms beyond those 
that we have already considered. Since the S-matrix is gauge-independent, all 
these counterterms must vanish by the equations of motion. 


13 


Anomalies 


A characteristic feature of relativistic quantum field theories is that 
symmetries of the classical theory are not always present after quantization. 
We do not mean here the spontaneous breaking that is characterized by a 
non-invariant vacuum and by the presence of the Goldstone bosons. Rather 
we mean a situation where there is no conserved current for the symmetry 
despite the absence of any terms in the action that appear to break the 
symmetry. Such breaking of a symmetry is called anomalous. 

If the classical action is invariant, then a naive application of Noether’s 
theorem gives us a conserved current. That is, there is no anomalous 
symmetry breaking. What prevents the argument from being correct is the 
presence of UV divergences. The current is a composite operator, i.e., a 
product of elementary fields at the same point, and to define it, some kind of 
regularization and renormalization is needed. The renormalization may 
invalidate the equations used to prove Noether’s theorem. 

For simplicity, we will consider only global symmetries, as opposed to 
local, or gauge, symmetries. The simplest cases of global symmetries were 
considered in Chapter 9. These could be treated by using an ultra-violet 
regulator that preserved the symmetry. The proof of Noether’s theorem can 
then be made in the cut-off theory. We showed that only symmetric 
counterterms are needed. Consequently the symmetry remains good after the 
cut-off is removed. 

However, not all symmetries can be preserved after regularization. The 
case which we will treat in this chapter is that of chiral symmetries. These are 
transformations that act independently on the left- and right-handed 
components of Dirac fields. These are particularly interesting because 
sometimes the anomalous breaking of chiral symmetries cancels. Indeed 
there is a theorem, first proved by Adler & Bardeen (1969), that if 
anomalous breaking of a chiral symmetry is zero to one-loop order then it is 
zero to all orders. 

Our treatment will use dimensional regularization. Chiral symmetry is 
valid in the physical space-time dimension d = 4, but not when d + 4. The 
anomaly in, say, an equation of current conservation will be an operator with 


332. Anomalies 


in effect a coefficient proportional to d—4, where d is the space-time 
dimension. This would vanish at d = 4, were it not for the existence of ultra- 
violet divergences which allow the anomaly operator to have a pole at d = 4 
so that a non-zero anomaly results at d = 4. We will see explicitly how this 
works. 

There are two key issues. The first is to derive simple forms for the 
anomaly. Of these the most dramatic is the Adler—Bardeen theorem that 
tells us that in some cases there is complete cancellation of the anomaly. The 
second issue is to derive the results in a form that is applicable to the 
physical theory, i.e., after renormalization and removal of the cut-off. Even 
though a particular derivation depends on the choice of a particular 
regularization scheme, the final results must be independent of this choice. 

Nete that to prove existence of an anomaly, it is not sufficient to say that 
the symmetry in question is broken in the regulated theory. The breaking 
may go away after removal of the cut-off. For example, if one uses a lattice 
cut-off, then Poincaré invariance is lost. However, one must prove that 
Poincaré invariance is restored in the renormalized continuum limit, if the 
theory is to agree with real world phenomena. 

Aside from the case of chiral transformations, there are a number of other 
important situations where there are anomalies. One of the simplest is that of 
dilatations. These are scale transformations on space-time: x“— Ax". A 
classical Lagrangian is scale-invariant if it contains no dimensional 
parameters, like a mass scale. But to cut-off ultra-violet divergences we 
necessarily introducea mass scale. Thesymmetry is necessarily broken inthe 
regulated theory and the question arises of whether the symmetry remains 
broken after the theory is renormalized and the cut-off is removed. This 
answer is, in general, yes, if the theory has interactions. The reason is that 
there is, in fact, a mass scale hidden in the renormalized theory, as we saw 
when we discussed dimensional transmutation and the renormalization 
group, in Chapter 7. Detailed treatments can be found in the literature (see 
Collins (1976), Brown (1980) and references therein). The simple form for the 
Ward identity is known as the Callan—Symanzik equation (Callan (1970) and 
Symanzik (1970b)). The information contained in the Callan—Symanzik 
equation is in fact also contained in the renormalization-group equation 
that we studied in Chapter 7. 

Other situations which we willnot treat include the following: chiral gauge 
theories (see Costa et al. (1977), Bandelloni et al. (1980), Piguet & Rouet 
(1981)), conformal transformations (Sarkar (1974)) and supersymmetries 
(Piguet & Rouet (1981), Clark, Piguet & Sibold (1979, 1980), and Piguet & 
Sibold (1982a, b, c)). 
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13.1 Chiral transformations 


We will consider QCD with two flavors of quark: up and down. (We could 
have more flavors, but no essentially new ideas would be needed.) The 
Lagrangian is (2.11.7). Ifthe quarks were all massless then the classical theory 
is invariant under the following transformations of the quark fields: 

y exp {iLi — yog + wht”) + H(1 + yo + ogy. (13.1.1) 
Here the matrices t° are the generators of the isospin group acting on the 
flavor indices. The transformations (13.1.1) form a group that we will call 
U(1), © U(1), ® SU(2), ® SU(2),. The symmetry of the QCD Lagrangian 
under these transformationsis broken by mass terms for the quarks. Since the 
masses of the u and d quarks are small, the chiral symmetries are only weakly 
broken. 

These symmetries and their breaking were understood well before the 
advent of QCD - see, for example, Treiman, Gross & Jackiw (1972). 
Treatments of chiral symmetries in the light of QCD can be found in 
Marciano & Pagels (1978) and in Llewellyn-Smith (1980). Thus it 1s 
unnecessary to go into details here. What we will emphasize is how the 
potential for anomalies arises. 

Since these transformations involve y,, they are in some sense coupled to 
the spin structure of the theory. Since spin is related to the symmetries of 
space-time, we can expect trouble when the theory is regulated, for 
imposition of an ultra-violet cut-off must alter the space-time structure. 

Notice that there is a U(1)@SU(2) subgroup not involving y,; these 
transformations have œw, = Wp. For them the treatment of Chapter 9 is 
correct. The corresponding Noether currents are 


jh =Zpy"h, 
je = Zyty. (13.1.2) 
The conserved charge derived from j“ is the conserved quark number, while 
the transformations generated by jë are just ordinary isospin trans- 
formations. 
For the other generators of chiral transformations, let us define axial 
currents 
js = Zr" ys]v, 
jas = ZWaLy", ys ]°y. (13.1.3) 
In four dimensions y"y, = — y,y",so that we could have written yy, in place 
of the commutator in (13.1.3). However, because we will use dimensional 
regularization, we must use the form (13.1.3) in order to ensure that the 
currents are hermitian. 
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To define the theory we must regulate its ultra-violet divergences, and for 
this we will use dimensional regularization. We will see that the regulated 
theory is not invariant under the transformations generated by the axial 
currents j§ and j%,. This allows the possibility ofan anomaly. When thecut-off 
is removed we will find that we can arrange for the non-singlet currents j*, to 
be without anomaly. Although we will not demonstrate it, it is true that the 
singlet current necessarily has an anomalous divergence. 

Our remarks above were addressed to the case that all the quarks are 
massless. But in the real world, there are quark mass termsin the Lagrangian, 
so we now generalize our discussion. The vector singlet current j“ is the 
current for quark number — i.e., 1/3 of baryon number — so this remains an 
exact symmetry. The SU(2) symmetry given by the vector currents is broken 
by quark mass differences: 

ô jt = —ip[t*, My, (13.1.4) 
where M is the quark mass matrix. Since the masses of the u and d quarks are 
small we have an approximate isospin symmetry of strong interactions. By 
the theory given in Chapter 9, the currents j# and j“, as defined by (13.1.2), are 
finite, since the breaking is from mass terms (Symanzik (1970a)). 

The axial symmetries are broken by the anomaly as well as by the quark 
mass terms: 

ô, ji = — 2iby,My + anomaly, 

ô jis = — iy, {t*, M}w. (13.1.5) 
The u- and d-quark masses are light enough to give us an approximate 
SU(2) ®@ SU(2)symmetry. The axial part appears to be spontaneously broken 
as well as explicitly broken. The abnormally light mass of the pion is usually 


taken to mean that it would be a Goldstone boson if m,=m,=0. See 
Marciano & Pagels (1978) and Gasser & Leutwyler (1982) for more details. 


13.2 Definition of Ys 


In order to formulate consistently the dimensional regularization of theories 
with fermions we had to define an infinite set of matrices y“, (u = 0,1,2,...). As 
we saw in Chapter 4, they satisfied the algebra 

{yt y} =2g""1, tri =4. (13.2.1) 
Our task is now to find a generalization of the matrix which at d = 41s called 
ys. Its four dimensional definition is 


ye=iy yyy’: (13.2.2) 
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and it satisfies the anticommutation relation 
(ys. ys =O (d=4). (13.2.3) 


It would be natural to assume that this relation (13.2.3) can be maintained 
for arbitrary values of d. Unfortunately an inconsistency arises, as we will 
now show, when we wish to compute the trace of y, with a product of the 
ordinary y”’s. The ultimate result will be that stated in Section 4.6, where we 
used the definition (13.2.2) of y, for all values of d. Then y, has mixed 
commutation and anticommutation relations, (4.6.3). 

We now demonstrate the inconsistency in trace calculations, starting with 


tr ys: 
dtr (y5) = tr(ysy"7,) 
= tr(y,y°y,) 
= —tr(ysy,y") 
— dtr (ys). (13.2.4) 


In the first and last lines we used 


Yul” =i") = gil =d. 
In the second line we used cyclicity ofa trace, and in the third line we assumed 
(13.2.3). From (13.2.4) we see that tr y, = 0 except at d = 0. Now when we 
apply dimensional regularization we wish to obtain a result that is a 
meromorphic function of d. Hence we must have try, =0 for all d. 
Similarly 
dtr YS Yuy = E Y 5Y yPyPaY 
= E VYY uY 
= = Ysy Y yY 
— 2ga tE V5Y,Ya + 293 tE Ys Yp Ya — Atl Ys Yy7y 
— 2 trys {Ym Yy] + (4 —Dtrysy yy 
= (4 — d) tr Y 5Y uyv (13.2.5) 
Here we used the result tr y; = 0. Hence we find that (d — 2) tr ysy,y, 1s zero. 
Now, the same technique can be used to prove that 


(4 — d) tr (Y5Yx Yau?) = 9. (13.2.6) 


At d = 4 this equation permits the usual non-zero trace of y, with four other 
Dirac matrices. However, if the trace is to be meromorphic in d, equation 
(13.2.6) shows that it must be zero at d = 4, and we can therefore not obtain 
normal physicsat d = 4. Thisis the inconsistency referred to at the start of this 
section. 
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We are therefore forced to drop one of the hypotheses that led to (13.2.6). 
Candidate hypotheses for removal include: 


(1) the anticommutation relation (13.2.3) (t Hooft & Veltman (1972a)), 
(2) the use of dimensional regularization for fermion loops (Bardeen (1972), 
and Chanowitz, Furman & Hinchliffe (1979)). 


These last authors have shown how to calculate with a totally 
anticommuting y,. Bardeen chooses to use a regulator other than 
dimensional regularization for all fermion loops. On the other hand, 
Chanowitz et al. regulate fermion loops with an even number of y,’s 
dimensionally. Their procedure is useful for low-order graphs, since the 
Ward identities are preserved for graphs without fermion loops. However, we 
then lose the use of dimensional regularization as a complete regulator; the 
theorems that we derived in Section 6.6 and in Chapter 12 no longer apply. 
The details of higher-order calculations by this method have not been spelled 
out. 

Therefore let us follow °t Hooft & Veltman (1972a) and Breitenlohner & 
Maison (1977a) and change the anticommutation relation. In fact, we may 
use the definition (13.2.2) for all values of d, just as stated in Section 4.6. Our 
definition is, of course, not completely Lorentz covariant, since the first four 
dimensions are picked out as special. But this is not an overwhelming 
objection, for our actual physics is confined to these dimensions. An 
important advantage of the definition is that it gives a concrete construction 
of y,. We are therefore guaranteed consistency. 

One notational inconvenience arises. We have a set of matrices y“ for 
p=0,1,2,.... Usually we only refer explicitly to the first four; the rest are 
referred to collectively. But there is y“ with u = 5. It is not the same as y, 
defined by (13.2.2). However, we bow to standard usage and use y, to denote 
the matrix defined in (13.2.2). Confusion should be rare. 

The y, so defined has mixed commutation and anticommutation relations 
that follow from (13.2.2) and the properties of y“. These were stated in (4.6.3). 
The derivation of (13.2.4)-(13.2.6) now fails, because y, does not anticom- 
mute with all of the y”’s. We can derive the correct relations easily. Since the 
trace of an odd number of y”s is zero, we have 


tr(y“ys)=0, tr (ysy"y*y") = 0, etc. (13.2.7) 


We may read off the trace of y, withan even number of y”’s from (4.5.13) and 
its relatives with more than four y-matrices. Thus we have 


tr (y5) = of 


(13.2.8) 
tr(ysy"y") = 0. 
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Finally 
tr (7 5YxVaYu?v) = Ale, uv (13.2.9) 


where é,.,,,, 1s given by (4.6.2). 

The relations (4.6.3) mean that thechiral transformations (13.1.1)no longer 
generate a symmetry if dis not equal to 4. To discuss the resulting problems, 
the following notation is useful (Breitenlohner & Maison (1977a)): 


» _ f9, if u and v are 4 or larger, 
"v )0, otherwise; 
pH = gry, 


(13.2.10) 


Here V“isany vector. Then ĝ „is a projector onto the unphysical dimensions. 
Thus, for example, 


At a gt = oe =d—4, 
n = 9" Guy = 9" Gp i (13.2.11) 


{F ys} = 2 Viiv ys} = 2, = 27; ĵ. 


We may also define projections onto the four physical dimensions: 
BPN for if u and v are less than 4, 
0 otherwise; (13.2.12) 
V" = g" V.. 
The following results are elementary, but will prove useful: 
27, = (6 — dy”, 
YP Y, = (4 — dy”, 
VY Ye =y 
ew a 


(13.2.13) 


Let us define w*=(w? + œ8)/2 and w =(— w? + w2)/2. Then the 
variation of Y under the chiral transformations (13.1.1) is 


5L = iỌ(20?y, + w*[M, t] + 03y: {M, CW — Wy Do + oey. 


(13.2.14) 
Hence the divergences of the axial currents are 
A, ia = — ibys {t*, M}y + yyy 

= — ibys {t*, My + Pys{e*, Biw/2, (13.2.15) 

ð, ji = — 2ipysMy + by py. (13.2.16) 


The second term in each equation can potentially give an anomaly when we 
let d— 4. 
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13.3 Properties of axial currents 


There are a number of somewhat different situations in which axial currents 
appear. The original papers on chiral anomalies primarily addressed 
anomalies in Ward identities of the chiral currents of strong interactions. 
More general cases have since been worked out, with corresponding 
generalizations of the Adler—Bardeen theorem. In this section we will list the 
various cases and state what is known. 


13.3.1 Non-anomalous currents 


The following properties apply, for example, to the non-singlet axial currents 
in QCD: 


(1) Ifthereisno anomaly to one-loop order (as for the current j*,), then there 
is no anomaly to all orders. The Ward identities of the current with 
elementary fields have no anomalies. 

(2) Under the samecondition as in(1)thereis no anomaly inthe two-current 
Ward identities. | 

(3) Under the same condition there is an anomaly in a three-current Ward 
identity (like the one for 0, (0| Tj*5(x)j#(y) 72(z)|O> ). However, the only 
non-zero term in the anomaly is the one-loop contribution. 


The theorem that the complete anomaly in these cases is determined by the 
one-loop value is due to Adler & Bardeen (1969). 

The lack of anomalies in the Ward identities of one current with 
elementary fields is essential if the currents are to generate the correct 
transformation lawfor the fields. These transformationsimply commutation 
relations for the currents. Since these commutators are also given by the 
Ward identities with two currents, the two-current identities must be 
anomaly-free. 

No such consistency requirement applies to the three-current Ward 
identity. The value of its anomaly is related to the decay rate for n° > 2y, and 
the lack of higher-order corrections enables a successful prediction to be 
made easily. (See Adler (1970); for reviews from the point of view of QCD, see 
Marciano & Pagels (1978) and Llewellyn-Smith (1980).) 


13.3.2 Anomalous currents 
The singlet current ją has an anomalous divergence. It has the form 
„J5 = C(g)G4,G%"” + mass terms, (13.3.1) 
where Gi, 1s the gluon field strength tensor and G, is its dual: 


GS Gs (13.3.2) 
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The Adler—Bardeen theorem asserts that the coefficient of C(g) is equal to its 
one-loop value 


C(9) = Nag? (322°). 


However, the operators j, and GG need renormalization, and there is not an 
obvious natural renormalization condition. So the valueof Cissusceptibleto 
change by redefinition of the renormalization prescription. We will not treat 
this case here. 


13.3.3 Chiral gauge theories 


Theories like the Weinberg—Salam theory of weak interactions have a 
gauged chiral symmetry. It is essential that there be no anomaly, for 
otherwise the theory is not renormalizable and loses other important 
properties (Gross & Jackiw (1972) and Korthals Altes & Perrottet (1972)). A 
generalization of the Adler—Bardeen theorem is that there is no anomaly to 
any order of perturbation theory if there is none to one-loop order. Proofs 
have been given by Becchi, Rouet & Stora (1976), and by Costa et al. (1977). 


13.3.4 Supersymmetric theories 


Supersymmetric theories have potential anomalies similar to the chiral 
anomalies. A completely general treatment has not yet been given, but many 
particular cases have been treated — see Piguet & Rouet (1981), Piguet & 
Sibold (198 2a, b,c), Clark, Piguet & Sibold (1979, 1980), and Jones & Leveille 
(1982). 


13.4 Ward identity for bare axial current 
Without use of the equations of motion the divergence of the non-singlet 
current j*, is 
ô jis =i Z,Wy stip — Mp + he 
T iZ,W{ Mo, thy T aZ wet D, ysy 
= Dt, + Di, + Di 


anom’ 


(13.4.1) 


The first term we will call the equation of motion term. When inserted in a 
Green’s function with elementary fields it gives 


O| T Demn] [4 [] yo 1ye] 
= d(x — y)<O| TT TAT ¥TT¥19 > yo+- rey 
+ 2 (x — z) <0]| TY TAT [VT] O zeppa (13.4.2) 
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The variations of the fields are just their chiral transformations multiplied by 
i. (We set we = — wf = w% in (13.1.1) to obtain these transformations.) This 
equation (13.4.2) has the expected right-hand side for the Ward identity in the 
absence of anomalies. 

The mass term on the right of (13.4.1) is expected ; it is the non-anomalous 
breaking. 

The anomaly term is D? om. If we insert it in a Green’s function with no 
divergences of any kind, then we may set d = 4. The result must be zero 
because D%_,,, vanishes if only the first four y-matrices are used (for then 

{D, y,} = 0). But if the Green’s function has an overall divergence, or ifit has a 
divergent subgraph that contains the vertex for Dlhom, then the result may be 
non-zero at d = 4, as we will see. The full Ward identity reads 

<0 Tj! [|A] [y[ [Y |0> = right-hand side of (13.4.2) + mass term 


+ <0| TDi nom] TAT WT T¥]0>. (13.4.3) 
Recall that in the case ofasymmetry such as the isospin SU(2) of QCD that 
has no anomaly, we used its Ward identity to prove the current finite. The 
only possible counterterm for the current is proportional to itself, so 
finiteness of the divergence of the current, 0-j,implies finiteness of the current 
itself. The Ward identities imply that the divergence of the current is finite. 
However, for the axial currents the extra term in (13.4.3) prevents this 
argument from being made. 


13.4.1 Renormalization of operators in Ward identities 


Our aim will be to construct a finite current jkas that at d = 4 satisfies a non- 
anomalous Ward identity: 


KANZ Tjas(x)| JA] [y] [Y |0> = right-hand side of (13.4.2). 
a (13.4.4) 


The first step is to observe that the only counterterm to j#, is itself. No other 
operators have the correct dimension and quantum numbers. So we can 
define a minimally subtracted operator 


[jis] = Zs jis 
=3Z; Z4“, ys]. (13.4.5) 
Throughout this section we will use square brackets to indicate minimal 


subtraction. So the renormalization factor Z, has the form 


Z, = 1 + poles at d = 4. 
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To obtain the operator jgas, we will later show that we have to invoke a 
further finite renormalization. 

We will show that the Ward identity (13.4.4) 1s not correct ifj,,, is replaced 
by the minimally subtracted [ j,,]. Rather we must make a further finite 
renormalization to obtain a Ward identity without anomalies. Thus we have 

Jas = 25L Jas] 
= 3252 5Z,W[y", ys ]ty, (13.4.6) 
where z.(g) is a finite factor. 

The anomaly operator D% om is a dimension four scalar quantity, so there 
are several operators with which it can mix. It is proportional to the -tensor 
times a fourth rank tensor, which we will call A**“’. Since ¢,,,,, appears 
nowhere in the Lagrangian, the tensor A**“” is invariant under all Lorentz 
transformations. Given these restrictions, a complete list of operators that 
mix with Df om is 


Ôa Jas Dems W{Mo, t* yew. 


No operator involving only ghost and gluon fields can be constructed such 
that it mixes with D¢,,,,- The linearity in ¢,,,, implies the presence of four 
factors of vector objects (derivatives or A-fields). Therefore the coefficient of 
the operator is independent of mass, by our usual results. Gauge invariance of 
D$ om allows a restricted set of gauge variant counterterms (see Chapter 12) 
none of which have low enough dimension to appear. Since j, is even under 
charge conjugation, so is D%_,,,. Therefore the only allowed counterterm 
proportional to quark mass has a flavor factor {M,t*} rather than the 
commutator [ M, t°]; this gives us the operator D4 that appears in (13.4.1). 

We can therefore write the minimally subtracted operator corresponding 
to [Dé om] as 


nom 


[D ] z ZaD noin + Zas0 Jas + ZamPM. (13.4.7) 


The equation of motion operator is finite by itself — see (13.4.2) — so we have 
[Din] = Dim- (13.4.8) 


Note that the definition of Dm includes some counterterms, but these are 
manufactured from the wave-function, mass, and coupling renormalizations 
in the Lagrangian. 

We also need the renormalization of the mass operator 


[D2,] = Zsu D4. (13.4.9) 


It is somewhat unobvious that the renormalization is multiplicative. The 
easiest method is to examine the y-matrix structure of self-energy graphs with 
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an insertion of Wt*y,w or wy. These operators can then be shown to be 
multiplicatively renormalizable with a common factor. 

By use of (13.4.6)-(13.4.9) we can express the equation (13.4.1), for the 
divergence of the bare axial current, in terms of renormalized operators to 
find 


[Dim] = Z5 “(1 — ZasZa *)0,L Jas] 
— [Di |Zsu(l + ZamZa )— Za *[Dinom]- (13.4.10) 
The renormalized operators on the right are all linearly independent, so the 


only way a linear combination of them can equal the finite left-hand sideis for 
the coefficients to be finite. Since we use minimal subtraction this implies: 


Z,=1, 
Zm=Z;ųu—l, (13.4.11) 
Zs =1-Z,. 


We therefore have the renormalized Ward identity: 

0 = 
aa 6O| T[j¢s || [A] [y] [y |0> = r.h.s. of (13.4.2) 

+ (O| T([Diq] + [Dénom] I TAT [yT IY 19>, 

(13.4.12) 


which apparently still has an anomaly. Before showing how the anomaly in 
fact disappears, let us examine some low-order graphs. 


13.5 One-loop calculations 


Thetree approximation for the two-point Green’s function of [D4 om Jis given 


by Fig. 13.5.1. To save algebra we will set quark masses to zero. The graph’s 
vaiue is 


ip’ Aa = aP 
ETET (13.5.1) 


If we let the external momenta be physical, i.e., in the first four dimensions, 
then this vanishes. The vertex for D? om has the property we define as 
evanescence: it vanishes when the cut-offisremoved and wego tothe physical 
renormalized theory. We will formulate a precise definition of evanescence 
later, when we have understood the subtleties associated with inserting the 


vertex inside loops. 


a 
D anom 


p p 
Fig. 13.5.1. Tree approximation for two-point Green’s function of [Dom]. 


a 
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(a) (b) (c) 
Fig. 13.5.2. Graphs up to order g? for two-point Green’s function of j#,. 


Next let us examine the graphs for the two-point function of thecurrent j*,,, 
as given in Fig. 13.5.2. Graph (a) has the value 


ip’ _, P 


- tt (13.5.2) 

D zY Y5 p? 

Taking the eee is the same as multiplying by i(p’ — p)”. The result is 
ip 


P- p)iyst" -z p? 


alp 
+ Ys 
y p)it 7 


-ya 


mnene 
= — ysti p/p? —i(p'/p°)yst + (13.5.1). (13.5.3) 


When we set p= p’ = 0, to get the four-dimensional result, we obtain the 
lowest-order case of the chiral Ward identity. 

The next order graph is Fig. 13.5.2(b). Its value, with the external 
propagators aren 1S 


, =u v 
7 k?(p’ + k)?(p + k)? 
This is evidently divergent. It is easy to calculate the pole at d= 4: 


OC ole 
30,2 P Ia ggr’ Very. 


a 
jira -97| 


Ta = 


pole (T,,) = 2 


Jys. (13.5.5) 


Here we have twice used the result that 
Va) Vsy” = Va) Vs)” + Vy sy" 
= ys = Vel PY s 
= (6 — d)y"y.. (13.5.6) 
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There is a counterterm graph implicit in the definition j#;= 
Zwy"yst*W, with the quark wave-function renormalization given by 
g° Cy 


aa! (13.5.7) 


Z,=1 


The resulting counterterm graph Fig. 13.5.2(c) therefore cancels the UV 
divergence of graph (b), leaving a finite result. No additional renormalization 
is needed: 


Z, =1 + O0(g*). (13.5.8) 

Let us next take the divergence of (13.5.5) plus its counterterm, by 
multiplying byi(p’ — p)„ Itis left as an exercisefor the reader to verify that the 
Ward identity (13.4.3) holds at this order. What we will do is examine the 
graphs of order g? for the Green’s function of D? These are listed in 


anom’ 


Fig. 13.5.3. Note that the definition includes a covariant derivative: 
Z% {P, Ys} y = 3Y Z 49, sftp — iGoZW{ Ao, ysjty 
= ZPY — 2igo ZW Aoy sty. (13.5.9) 
The ~Ay.y term gives rise to the graphs (b) and (c). 
p p 

k 

(a) (b) (c) (d) 
- Fig. 13.5.3. Graphs of order g? for two-point Green’s function of [D%, on]. 
Graph (a) equals 


O Cpt o aafaa AB + OE + B+ Wyld + Wy” 
ea ee 


x {13 — d/2)D”? -3y [p — y) — px] [b'(L — 2y) + pl — 2x)] 
x ya — x) — By ]y” 
— 31-2 — d/2)D*?~?y, [UPA — y) — Bx) P57" 

+ 2y,3*y5(P(1 — x) — p’y) 

+ ydp — 2y) + BL — 2x))ysy"Jy"}, 


(13.5.10) 
where 


D = — p*x(1 — x) — p° y(1 — y) + 2p-p'xy. (13.5.11) 


If it were not that there is an ultra-violet divergence given by the 
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T (2 — d/2), we could let p and p’ be four-dimensional, and then set d = 4 to 


obtain zero. (Note that y," = 9,9” = d — 4.) The pole prevents this argument 
from being made. First of all, notice that the pole is 


j e l 
poles) = ~ gett poled znad + Bra r} 
o y ’ 
s des (13.5.12) 
Tt — 


The manipulations on the Dirac matrices are easy to do incorrectly, so let us 
be careful. We need the following result 


Val Vel = Vel V57" PO Y7" 
= VP V5 — VP Ys + 275 
= (10 — d)p*y.. (13.5.13) 
In the first line we split y* into a four-dimensional piece y* and a 
(d — 4)-dimensional piece y“. Then in the second line we used the 
commutators or anticommutators of y“ and y* with j“ and y,. 
The graphs of Fig. 13.5.3(b) and (c) may be evaluated similarly. The sum of 
the pole terms for all three graphs is 
= ig’Cy aP a B)Y 


13.5.14 
877 4-—d ’ ( ) 


pole (T34 + I3 + T3.) = 
whichis cancelled by thecounterterm Fig. 13.5.3(d). Thisisin agreement with 
our general result (13.4.11). 

We are now ready to compute the value at d = 4 of the sum of the graphs of 
Fig. 13.5.3. Considerable simplification occurs. Since p and p’ are now zero, 
the term in (13.5.10) that multiplies I (3 — d/2) vanishes. Similarly the last 
term multiplying ['(2 — d/2) gives zero. The remaining two terms have a 


factor %9, =d — 4, which cancels the pole to leave a finite result: 
ape 4 
ig“C 
Wag tg epg = TE t"(p = P)Y 5: (13.5.15) 


Similarly T}, and T}, give 


ig?Cy 

(13, + ls) psa gece =- t"(p' — Pys. (13.5.16) 
Effectively Fig. 13.5.3 sums to g*C,/(4n*) times the vertex for 

O-j,5. It is easy to understand why the result should be of this form. 

Without ft loop integration, the vertex for D{,,,, vanishes when 

k = p =p’ = 0. When we include the integration over the components of k, we 


can a a non-zero value for the graphs even if p = p’ = 0. However, the 
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evanescence property of the basic vertex implies that it has effectively a factor 
d—4. We only get a finite result by multiplying by an ultra-violet 
divergence — so the effect at d =4 is of a local operator. 

A general theory of evanescent operators can be worked out. The results 
simply generalize what we have learnt from examples: 


(1) Wedefine an evanescent vertex as one that is finite and that vanishes ina 
tree graph when weset d = 4and when all momenta and polarizations are 
four-dimensional. 

(2) A Green’s function or a graph or an operator is evanescent if it is finite at 
d = 4and if it vanishes when its external momenta and polarizations are 
four-dimensional. | 

(3) Consider a graph containing an evanescent vertex. If the graph is 
completely finite then it is evanescent. (“Completely finite’ means that the 
graph and all its subgraphs have negative degree of divergence.) 

(4) A renormalized operator [E] whose basic vertex is evanescent has the 
following expansion: 


[E] = } Cey[V] + evanescent operators. (13.5.17) 
4 


The sum is over operators V whose basic vertices are non-evanescent. 
The only operators that are needed are the ones that according to the 
usual power-counting and symmetry requirements will mix with E 
under renormalization. The general proof is left as an exercise to the 
reader. 
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13.6.1 Reduction of anomaly 


The only operator that can appear on the right-hand side of (13.5.17) for the 
case E = Dom is 0:[j,5]. The restrictions are that it be pseudoscalar, 
isovector, gauge invariant and of dimension at most four. (If we had non-zero 
quark masses, then the operator [D{, | could also appear.) So we have 


[ Doom | = C(g)d-[ j4; ] + evanescent, (13.6.1) 
which, when substituted into the renormalized identity (13.4.12), gives 
0 z 
za OTA - CLs [A] Iy] 1410> 
=r.h.s. of (13.4.2) + evanescent. (13.6.2) 
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So we should define the renormalized current 
Jas = (1 — C)[ 45] 
=(= jhe, (13.6.3) 
which is (13.4.6) with z, = 1 — C. For the physical four-dimensional theory 
this implies that j2,, has Ward identities with no anomaly, viz. (13.4.4), as we 
wished to prove. | 
From our calculations of Fig. 13.5.3 we see that 


C = g?C,/(4n2) + O(9°). (13.6.4) 


Our proof has been long and involves a general theory of evanescent 
operators summarized in Section 13.5. The basic idea, however, is simple. 
The only way an anomaly can appear in the physical theory is when a 
divergence cancels an effective factor of d — 4 for the evanescence of an 
anomaly. The anomaly in the four-dimensional theory is a local operator, 
and the only possible operators are those which power-counting would allow 
as counterterms to ĝ'j. 

In the case of our isovector current, the only such operator is 0-jitself.Soa 
finite renormalization (13.6.3) serves to eliminate the anomaly at d = 4. 


13.6.2 Renormalized current has no anomalous dimension 


Let us apply the renormalization-group operator pd/du to the Ward 
identity (13.4.4). For the right-hand side we get 


d . l ; 
u right-hand side = right-hand side {2 anomalous dimensions of fields}. 
m 


To get the same result for the left-hand side, the current jk, must have zero 
anomalous dimension (when d = 4): 
d 
Hy Jras = 0. (13.6.5) 
u 
This is a sensible result: the current jġ„s 1s a physical object, and it should not 
depend on how we parametrize the theory by a renormalized coupling. 


Useful consequences follow, for the minimally subtracted current does 
have an anomalous dimension: 


ud[ jas]/du Se re YsLJas_|> (13.6.6) 
where 


y5(g) = udln (Z5)/dy 
= [(2 — d/2)g + B(g) ]éln(Z,)/dg. (13.6.7) 
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Now, the coefficient C is a function of g; it is dimensionless and even in the 
presence of masses cannot depend on them, just like the renormalization 
factor Z,. We also define it to have no cut-off dependence, since it is a factor 
between renormalized operators at d = 4. 

We therefore have 


0 Ud jpgs/dp 
= — B(6C/0g)[ jas] — (1 — Oysl ius], 


so that 
Bol — C)/ðg = y,(1 — C). (13.6.8) 


It follows that 
g 
1 —C=exp f | agy), (13.6.9) 
(0) 


where we used as a boundary condition the fact that C has a perturbation 
expansion starting at order g?. In order that the integral in (13.6.9) be 
convergent, the order g? term in y, must vanish. 

So from the definition (13.6.7) it must be that Z, has no order g* term; this 
we know by explicit calculation. Moreover, we know from (13.6.4) the one- 
loop value of C, so that 


Ys = — A,g*C,/(2n7) + O(g’), (13.6.10) 
where the one-loop term in Bis — A,g°. Hence (13.6.10) gives us a prediction 


of the leading divergence in Z,: 


A,Crg* 
Z,=1~—+~—"— + 0(g°). 13.6.11 


The reader is invited to check this by Feynman graph calculations. 
We may use the techniques of Chapter 7 to sum the divergences. We find 
the full renormalization factor of jp,, to be 


JRas —1 
o (= C)Z; 


g 1 l 
exp} [ag Ys(g Na azar | 
B Fe ain T 
= | 7 ALa- Da + BO) 
= 1 + (d — 4)0 [ln (g? /(4 — d))] 
ei (13.6.12) 


Evidently the Noether current is finite in the complete theory. 
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13.7 Three-current Ward identity; the triangle anomaly 


13.7.1 General form of anomaly 


We consider the Green’s function 
D4 (p,p) = | déxdtyel?**”"0| Tja). (13.7.1) 


Only connected graphs contribute; Lorentz invariance forces the vacuum 
expectation value of a current to be zero. The currents are all renormalized 
currents, so all subdivergences are cancelled by counterterms, and the only 
possible infinity in (13.7.1) is an overall divergence. In fact there is no overall 
divergence, as we will now show. 


A k S 
Fig. 13.7.1. Lowest-order graph for (13.7.1). 
Individual graphs for (13.7.1) have a linear divergence, as can be seen from, 


say, Fig. 13.7.1. Any divergence must be linear in external momenta and 
proportional to the e-tensor. The only possibility is 


etm a(d)p, + b(d)p,.]. (13.7.2) 
There is also the constraint of conservation of the vector current. This is 
expressed by constructing a Ward identity in the dimensionally regularized 
theory. 
Consider 


p,D*"” = —i [atxdtyelrs rer nel Tips) HOO RO)|O>. (13.7.3) 


By use of the result 
iô Z pyr = Z,ptiP — Mob — Zp( — iP — Mo'y 
and the equations of motion, we find 


p,D™” = | dixie +P) 0| Tjè s (OZ JEE, ]y"w(x)|0> 


+ | dyi” <0] TZ ap [re PIP 7s O)|). (13.74) 


In these equations we assumed that the currents j# and j? are conserved. Each 
of the terms in (13.7.4) is a Green’s function of a vector and a pseudovector 
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current. Parity invariance forces them to be zero, so 


p,D**’ =0. (13.7.5a) 
Similarly 
p.D**” = 0. (13.7.5b) 
The counterterm (13.7.2) must therefore give zero when multiplied by p, or 
p,. This forces the whole counterterm to be zero ; the Green’s function (13.7.1) 
is finite as it stands. 
In the regulated theory the axial current is not conserved, so we cannot 
prove the Ward identity (p + p’),D*“” = Oby the same manipulations. Indeed 
we have 


(p+ p) D” = commutator terms + [atxatyeres +p“y) y 


x €0| TEO j). (13.7.6) 
Here E is the evanescent operator in (13.6.2): 


E = [Dinom] _ Co-[ jas | 
= [Dinom] = c(l == Cy OT kus (13.7.7) 


The commutator terms in (13.7.6) vanish, as in the Ward identity for the 
vector currents. Hence, finiteness of the left-hand side implies that the Green’s 
function of E with j¥ andj‘ isfinite. Even though graphs for it are quadratically 
divergent, the divergences cancel. 

Now, E is an evanescent operator. This means that its Green’s functions 
with elementary fields vanish in the four-dimensional theory. The general 
theory of evanescent operators, which we summarized at the end of 
Section 13.5, then tells us that the only way that the right-hand side of (13.7.6) 
will fail to vanish is for E to be part ofa graph or subgraph with overall degree 
of divergence at least zero. Now, the definition of E has ensured that these 
subgraphs are all evanescent. Hence we are left with the complete graphs. So 
we have (at d = 4) 


(p + p’),D“”” = A(9)E,vagP"P ” Eade 
= FA Ey vaglP at: p')*(p 7 DP) ee (13.7.8) 
The tensor structure is the only one possible. The coefficient A is 


dimensionless at d = 4, so it can only be a function of g. Note that the right- 
hand side of (13.7.8) obeys vector current conservation, so that 


p,(p + p), D = 0, 
p(p + p’),D**" = 0, 
as should be. 
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13.7.2 One-loop value 


~ The lowest-order value of A is easily computed from the graphs of Fig. 13.7.1. 
te pepe 3 (ak LB =P + 2B + Bkk- PI] 
uvapP p 4 (27)? k?(p $ k)?(p’ _ k)? 
+ charge conjugate. (13.7.9) 


The factor 3 is the number of quark colors. To evaluate this, notice that 
Ka À) — 
tr (7577) 0, om l (13.7.10) 
tr (57" p yey’) = 4e”. 
Since £, iny is restricted to the first four dimensions, it follows that the trace of 
y, with four Dirac matrices is zero if one of the matrices is a ĵ: 


tr (y <7" y*y"y’) = 0. (13.7.11) 


Let us commute the — p’+ Ķ in (13.7.9) to the left, and use (13.7.11) 
whenever possible. We will also set p= p’ = 0. The result is 


tr [Ø P + 2Rys(b + Dyk- p +H] 
= tr [2y (p + Kyk — ky] + 0 
tr [2y (p + Ky“kp' y] +0 
— tr [2y (p + Hky“p' y] +0 
— tr [2Ryspky“p'y”] + 0 
— tr [2kkypy"p’y’] +0. (13.7.12) 
The terms indicated by ‘0’ vanish by use of (13.7.11). The charge conjugate 


term gives an equal contribution. 
We now have 


dik tr [Ry sby"P'y"] 
(27)° k*(p' — k)? (p + k)? 
This is now only a logarithmically divergent integral. After use of Feynman 
parameters the standard result (4.4.14) gives 


AE svap P P? = 3 (13.7.13) 


w Aa S 
AE yyapP"P” = Fpa | ax | dy5I(2 — d/2) x 
(0) (0) 


try, y"ysPy p y] 


ee LEELA E Ed 
p= pe A 


3 aB 
= — Ba? ouvasP p . (13.7.14) 


The evanescence of the vertex has effectively given a factor of d — 4 which 
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cancels the UV pole to leave a finite result 
A= — 3/(8n7) + O(g”). (13.7.15) 


This result for the anomaly in the axial Ward identity for (13.7.1) was first 
found by Adler (1969), and Bell & Jackiw (1969). 


13.7.3 Higher orders 


There are, infact, (Adler & Bardeen (1969))no higher-order corrections to the 
anomaly for (13.7.1). We will follow the proof due to Zee (1972). The basic idea 
is simple. Each of the currents in (13.7.1) is RG invariant, and there is no 
overall counterterm. Therefore this Green’s function is invariant when we 
make an RG transformation. The anomaly must therefore be invariant 
also. But the anomaly coefficient A(g) depends on the coupling g and on no 
other parameter of the theory. We can change g arbitrarily by changing the 
renormalization mass u. Hence A is independent of g. 

This proof may easily be written out. Renormalization-group invariance 
of DŻ” is the equation 


abc 
Hence 
u< Aloe e D Dy =(p + pu Dit =0. 
du abc al B A du abc 


Since ud/du = p/u + Bo/dg, this gives 


BodA/eg = 0. 
Then A is independent of g, so it equals its lowest-order value: 
A = — 3/(827) exactly. (13.7.16) 


This is a very striking result. The proof we have given is very simple, but the 
reader should not suppose it is not a deep result. The whole power of the 
renormalization apparatus is needed for its derivation. We first had to show 
that there isno anomaly in the divergence of the axial current. Then we had to 
show that there was no counterterm needed to make D?#’ finite. These results 
involved considerable cancelation of UV infinities. Sincethe anomaly in ¢-j,. 
disappears when the UV cut-off is removed, it can affect a Ward identity only 
by being enhanced by a UV infinity which has not made its appearance 
earlier. 

Thus the anomaly is associated with a UV pole implicit in the Feynman 
graphs. It is precisely for this reason that it must have the dependence on the 
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parameters of the theory and on the external momenta that is characteristic 
of a renormalization counterterm. In particular, it is polynomial in the 
momenta and masses of the degree determined by U V power-counting. Once 
thisisclear,the most general possible form of the anomaly is (13.7.8). The final 
step to show that A is independent of g is trivial. 

An important phenomenological consequence of the anomaly is a 
calculation of the decay rate for n? > 2y (see Marciano & Pagels (1978) and 
Llewellyn-Smith (1980)). The amplitude is proportional to the number of 
quark colors, so the decay rate is proportional to the square of this number. 
The measured rate in fact agrees with the standard theory that there are 
three colors. 


14 


Deep-inelastic scattering 


In this chapter we willshow how the operator-product expansion can be used 
to compute the cross-section for deep-inelastic scattering. Since the 
calculation is fairly straightforward, this process is one of the classic tests of 
quantum chromodynamics (QCD). 

The process is the scattering of a lepton of momentum /4 on a hadron of 
momentum p“, with the only observed particle in the final state being the 
lepton: 


l+ N>! + anything. (14.0.1) 


In practice one uses a beam of electrons, muons, or neutrinos, and the 
hadrons in the target are nucleons. There are a number of cases for which 
there are experimental data, the cases being distinguished by the types of 
lepton involved: 


e + N >e + anything, 

u + N > u + anything, 

v+ N >v + anything, (14.0.2) 
v+ N-(e or u) + anything. 


The basic reason for measuring the cross-section for these processes is to 
study the fundamental constituents (quarks and gluons) of the hadron. 
Suppose we have a scattering of a lepton on a small-sized constituent, and 
that the momentum transfer is large, so that the scattering happens over a 
small time-scale. The weak and electromagnetic interactions have a small 
coupling, so to a good approximation the lepton does not interact again. 
Moreover the interactions that turn the final state, involving the struck quark, 
into hadrons happen on a much longer time-scale. So these interactions do 
not interfere with the basic Born graph. Hence, we should beable to calculate 
the cross-section for the process (14.0.1) rather simply in terms of the cross- 
section for lepton—quark scattering (Fig. 14.0.1). The approximation in 
which final-state interactions of the hadrons are ignored is called the impulse 
approximation. We can use it because we choose to sum the cross-section 
over all hadronic final states. 
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X 


Fig. 14.0.1. Parton model for deep-inelastic scattering. 


A more mathematical formulation of the same idea is the parton model, 
explained in Section 14.2. There are, however, many weak points in the 
intuitive discussion just given, and we must remedy them. In the remainder of 
this chapter we will compute the cross-section from the theory of strong 
interactions (QCD). The parton model will in fact give a qualitatively correct 
approximation to the real cross-section. Our intuitive argument shows why 
deep-inelastic scattering is simple enough to permit calculations. 

Our treatment in this chapter is based on material to be found, among 
other places, in Gross (1976) and Treiman, Jackiw & Gross (1972). 


14.1 Kinematics, etc. 


We will compute the cross-section to lowest order in weak and electro- 
magnetic interactions. Then the amplitudes contributing to the process 
(14.0.1) have the form of Fig. 14.1.1 where a boson is exchanged between the 
leptonand the hadron. The boson can bea photon,a Wor a Z. Athighenough 
energy, it is also necessary to include Higgs boson exchange. Higher-order 
weak and electromagnetic corrections do not need to be included, except for 
soft photon effects. We will ignore the soft photon corrections here, and will 
concentrate on understanding the strong-interaction corrections. 


l 
[' 
q 


X 
p 


Fig. 14.1.1. Amplitude for process contributing to deep-inelastic scattering. 


We first review the kinematics of the process. The two independent 
Lorentz invariants of the hadron system are chosen to be 
Q? OT Q"4,> 
v=p'ą, (14.1.1) 
where q“ is the momentum transfer from the leptons. The mass of the final 
hadron state X is then 


m? =(p +q}? =m + 2v — Q?. (14.1.2) 
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In the laboratory frame, where the initial hadron is at rest, we can express Q? 
and vin terms of the initialand final lepton energies E and E’, and of the lepton 
scattering angle 0: 


Q? = 2EE'(1 — cos 0). 
v=m,(E — E’). (14.1.3) 
We have neglected lepton masses with respect to E and E’. The following 
inequalities hold 
v>Q7/2>0. (14.1.4) 


The region we will investigate is where both Q? and m? get large in a fixed 
ratio. We define the Bjorken scaling variable x = Q7/(2v). Then we let Q? get 
large with x fixed. This is called the Bjorken limit or the deep-inelastic region. 
In this region the missing mass my is large: 


mz = Q7(1/x — 1), (14.1.5) 
(where weneglect m} compared with Q”). In order that m? be positive we must 
have 0 < x < 1. The Bjorken limit applies only if x is not at its endpoints. An 


equivalent variable to x that is sometimes used is œw = 1/x. 
The cross-section is given by 


do ] do 


, 


d3p’ 2nE' dE'dcos0 


ames” X16 


= IME RN > DMPO p — p — 4") 


== Jw l KHU * T)AV 
-(%) mrg lD DAL, QWaslPs D- (14.1.6) 


Here /'*"' and jẹ? are the currents to which the exchanged vector boson 


couples, gy is its coupling, and D*"(q) is its propagator. The lepton tensor L, , 
is easily = in the tree approximation for 


= CU REOS ST FE" (OLD. (14.1.7) 


The hadron tensor is cca to 


W,(D,q) = zI < p|i,(0)' | X >< X |7,(0)| p >(22)* 5 (pk — p" — q") 


R -| dtye't?< p| j OVO |p), (14.1.8) 
T 


where the normalization is the standard convention. 

Deep-inelastic scattering is the region where Q gets large with x fixed. This 
is not the short-distance limit we treated in Chapter 10. There we took all 
components of q” to infinity in a fixed ratio, so that Q > œ with Q/p-q fixed, 
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i.e., xis proportional to Q. Thisis notin the physical region0O < x < 1for deep- 
inelastic scattering. Luckily we can use a dispersion relation (Christ, 
Hasslacher & Mueller (1972)) to relate the deep-inelastic limit of W to the 
short-distance limit of the time-ordered matrix element 


Thy = | d*ye*”< p| Tj OVJ 00>. (14.1.9) 


We will perform this analysis in Section 14.3. 

Let us now decompose W,, into tensors with scalar coefficients. We will 
assume that the hadron is unpolarized, i.e., that we average over its spin 
states. Then the most general form of W,, is 


Wav = Wi = Guy + 44/9") + W(P, — 4,¥/4°)(P, — 9,V/9") 
— i WE uvapb*g (2M?) + Wi9,4,/M? 
+ W;(p,4, + q,P,)/2M°) — iW6(P „dy — 4„P,)/(2M°). (14.1.10) 
The scalar coefficients W(Q?,v) are called the structure functions. The 
normalizations are such that they are dimensionless. A number of properties 


follow from symmetries and basic quantum mechanics (Treiman, Jackiw, & 
Gross (1972)): 


(1) Each W, is real. 
(2) Time reversal invariance of strong interactions implies that W, = 0. 


In the case of a purely electromagnetic process, the current Pa is 


conserved, so that q"W,, =0. Hence W, = W, =0. Moreover the elec- 
tromagnetic current is a pure vector, and strong interactions are parity 
invariant. So W, =0. 

In the case of neutrino scattering, the currents are only conserved if quark 
masses are zero. In that case the only non-zero structure functions are W,, W, 
and W,. It turns out that when the masses are non-zero the contributions of 
W, and W, to the cross-section are suppressed by a factor of order mm,/Q’, 
where m, and m, are lepton and quark masses. We will discuss this further in 
Section 14.8. 

When we compute the structure functions in the Bjorken limit, Q? — œ, x 
fixed, we will find that they behave as a certain power of Q? times logarithms. 
The power is (Q7)° for W,, 1/0? for W, and W,,and 1/Q* for W,, and W,. Soit 
is convenient to anticipate these results and define scaling structure functions 
F(x, Q?) which depend only logarithmically on Q?. The standard de- 
finitions are: 


F (x, Q?) = W,(Q?, v = Q*/(2x)), 
F(x, Q?)= vW,/M2, for i=2 and 3, (14.1.11) 
F(x, Q?)=v?W,/M*, for i=4 and 5. 
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14.2 Parton model 


Before we discuss the true theoretical predictions for deep-inelastic 
scattering, let usdiscuss the parton model (Bjorken & Paschos(1969)). Thisis 
the simplest model for the process, and contains the essence of the correct 
physics. One considers the initial state hadron in the overall center-of-mass 
for the whole scattering. Now time dilation slows down processes in the 
hadron so that they typically occur on a rather long time-scale T of order 
Q/m?,. Then the scattering happens on a much shorter time-scale T ~ 1/Q. 
This implies that the hadron may be regarded as an assembly of non- 
interacting point-likeconstituents. These are what were originally (Feynman 
(1972)) called partons. (We now identify them as quarks and gluons.) Since 
the hadron in this frame is ultra-relativistic, we must regard the partons as 
massless and aseach moving parallel tothe hadron witha certain fraction z of 
its momentum. 

The structure functions of the hadron are obtained by computing the 
partons’ structure functions in tree approximation and by then summing 
over all partons weighted by their number density. 

Let f „n (z)dz be the number of partons of type ain hadron N with fraction z 
toz + dz ofits momentum. Then, for example, the electromagnetic structure 
functions are: 

2F I(x) = x FSG) 
av 3L Syn) + fa(x)] +4 2 [Sfan(x) + fyun(x)] 
q=d,sS 


+ heavy quark terms. (14.2.1) 


The relation F, = 2xF is characteristic of the spin-4 of the quarks (Callan 
& Gross (1969)). Notice that F, and F, are independent of Q?. This is the 
property known as scaling. Experimentally, F, and F, obey this property 
approximately. 

It is common in the literature to use the terms ‘structure function’ and 
‘quark distribution’ interchangeably, because of the parton model relation 
between them (14.2.1), which is also approximately true in QCD — see 
Section 14.7. However, it isimportant to distinguish the two terms. Structure 
functions are coefficients of certain tensors in a current correlation function. 
They can be defined for other processes, e.g., the Drell-Yan process (Lam & 
Tung (1978)). On the other hand the quark distribution functions are exactly 
what their name implies: probability distributions of quarks in a hadron. 

The parton model is correct in a super-renormalizable theory (Drell, Levy 
and Yan (1969, 1970a, b,c)). However, in a renormalizable theory like QCD, 
there are processes inside a hadron that happen significantly on all time scales 
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down to zero. Then the basic assumption of the parton model does not hold. 
However, the fact that it is short times that cause the problems allows the 
operator-product expansion to come into play to solve the problem. There 
are two approaches, essentially equivalent for deep-inelastic scattering: 


(1) Useadispersion relation toshow that moments of thestructurefunctions 
(i.e., 


F, (0°) = [axe ! F, (x, Q7)) (14.2.2) 


can be directly computed by the Wilson expansion. This approach was 

initiated by Christ, Hasslacher & Mueller (1972), and it is the one we will 

use. 

(2) One can generalize the derivation of the operator-product expansion 
(Amati, Petronzio & Veneziano (1978), Ellis et al. (1979), Libby & 
Sterman (1978), Stirling (1978)). For deep-inelastic scattering, the result 
is equivalent to the first method without the taking of moments. 


14.3 Dispersion relations and moments 


Consider the time-ordered Green’s function T,,, defined by (14.1.9). It can be 
expanded in scalar structurefunctions T,, T,, T3, T,, Ts, Te, just like W,,. We 
will only need T,, T,, and T,. The operator-product expansion derived in 
Chapter 10can be applied to T, when Q? and v get large with Q”/v’ fixed. As 
we have seen, this is not the scaling region, for we have x—> œ instead of x 
fixed. However, we will relate T, to W,,, byadispersion relation. Then we will 
see that information on W,, in the physical region can be obtained from the 
operator-product expansion for T,,,. 

If Q? is fixed and positive then each T, is analytic in the v-plane. There are 
cuts going out to infinity from the thresholds v = + Q7/2. See Fig. 14.3.1. 
(This is a standard property. It can be proved by expressing 


Fig. 14.3.1. Analyticity of W,, and contour to derive (14.3.1). 
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<N|Tj,(y)j,(0)| N > in terms of W,, and noting that W,, is zero if |v|< Q?/2.) 
The discontinuities across the cut are: 


T,,(p,q) |= = 4rW, (p,q), (if v > 0). 


| (14.3.1) 
T(P, qQ)|\ i = 4n W, (P,Q), (if v <0), 
where W,, is W,, with j replaced by its hermitian conjugate j' : 
RA 1 
W = |d*ye'*”< plj OAO |p >. (14.3.2) 
4n 


For the electromagnetic or neutral-current processes the current is 
hermitian, so that W,, = W,,. But if W,, is for charged-current neutrino 
scattering then W, „gi ves the : structure functions for antineutrino scattering. 


By Cauchy’s theorem we have 
d / 
TAQ?) =>> | =— TAQ? v’), (14.3.3) 


cY —y 


where Cis anycontour a, v, as Shown in Fig. 14.3.1. We will beable to 
compute the T,’s in the short-distance limit v/Q* —> 0. So suppose we expand 
T;(Q?, v) in a power series in 1/x = 2v/Q?: 


T,= a Ti ,(Q7)x~", 


vT,/M? = QT Q7)x"" (i=2 or 3). (14.3.4) 
(We expand vT,/M? (ifiis 2 or oi analogy with(14.1.11).) Then from (14.3.3) 


T; (Q°) 


ie 


v’ Q? n 
y (5 T (Q? v')(v'/M*)%, (14.3.5) 


where a;=0 if i=1 and a;=1 if i=2 or 3. If n is large enough to give 
convergenceat|v’| = oo, then wecan deform thecontourand pick up only the 
contribution from the ae eee of T, across the cuts: 


00 2 aj 
r= |" (5) wav sz) 
Q?/2 v 
2 ae dy FO? \" Pe Ge ome i 
+] ane-a) 436 


Finally we write the right-hand side in terms of the scaling functions 


F(x, Q?): 
T; ,(Q7) aoa ai | x's" LF (x’, Q?) F ( ~~ 1)" F(x’, Q?)] 
0 
=— 2i[ F; ,(Q7) + (— 1)"**F(Q7)]. (14.3.7) 
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This is the dispersion relation referred to earlier. It relates the power series 
expansion of T, about v = 0 to the moments of the F’s, which are defined by 


1 
F; y(Q7) = | dxx%~! F(x, Q7). (14.3.8) 
(0) 

The relation (14.3.7) only applies if the integral is convergent. For small 
enough values of nit diverges. If v“ W, behaves like v? as v > œ, then we have 
convergence only if nis greater than p. Now the limit v > œ at fixed Q? isa 
Regge limit (elastic scattering ofa virtual boson offa hadron at energy mZ).So 
there is a standard expectation (Treiman, Jackiw & Gross (1972)) that p = 1 
fori = 1 or 3and p = Ofori = 2. Thisisequivalent to F ,, F,/x, F, all behaving 
like 1/xas x > 0. Inthe parton model this would correspond toa 1/x behavior 
for quark distributions, and is roughly what is measured experimentally. 

Our theoretical predictions will give all the terms in the series expansions of 
the T,’s, e.g. (14.3.4). Those coefficients for which (14.3.7) does not apply will 
not have any direct implications for deep-inelastic cross-sections. 


14.4 Expansion for scalar current 


To explain without a slew of indices the method for computing moments of 
structure functions, let us first work out the case where j, is replaced by a 
hermitian scalar operator j. For example, j might be ZZ, g,q;, appropriate to 
the coupling of a scalar boson to a particular flavor i of quark. We have a 
single scalar structure function: 


F(x, Q?) = W(v, Q?) = (1/42) jd*ye*”<pli(y)iO)|p>, (14.4.1) 


while the time-ordered function is: 
T(v, Q?) = [etvercn Tjj |p). (14.4.2) 


(Notice that we choose j to be a renormalized operator. As is the case for the 
$? operator for a scalar field, the renormalization factor for gq is the same as 
the mass renormalization factor, so [ gq] = ZZ,,4q = Zm01o» Where qo isthe 
bare quark field.) 

The dispersion-relation argument applied to the series 


T(Q2/(2x), Q?) = ` T,(Q)x~" (14.4.3) 
gives 


T,(Q) = — 2i[F,(Q) + (— 1)"F,(Q)]. (14.4.4) 
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Since the current is hermitian, we have F(x, Q) = F(x, Q), so that 


—4iF (Q), if nis even, 
T.(Q) = ‘ 
(2) i 0, if n is odd. 


Regge theory suggests T ~ 1/x as x > 0, so this equation is valid only if n is 
bigger than 1. 

We now apply the operator-product expansion. The results of Chapter 10 
apply in the limit Q? > œ with v?/Q? fixed, i.e., with x = constant times Q. 
Now 


(14.4.5) 


T,x ~" =Q7"[T,(2v/Q)"], (14.4.6) 


so that it would appear that all but then = 0 term are non-leading bya power 
of Q? and that we only have a reliable prediction for n = 0. But the relation 
(14.4.5) is not expected to hold unless n> 1. 


a. 


Fig. 14.4.1. T(v, Q). 


To remedy this problem, we must find an object for which x`" T contains 
the leading-power behavior as Q > œ. Thisisdone by making a partial wave 
decomposition in the t-channel. That is, we treat T(v, Q) (Fig. 14.4.1) as we 
would treat a scattering amplitude, and decompose it in terms of angular 
momenta: 


<p] Ti(y)j(0)|0) = 2 ¢plVj|p>M,(y-q/@2). (1447) 
=0 


Here < p| V,|p>isthereduced matrix element of some operator V, ,, ofspin J, 
and the M , are appropriate polynomials in y-q/q’. (The operator V, is not 
necessarily local.) 

Now we perform an operator-product expansion of Tj(y)j(0), keeping the 
leading-power behavior for each spin: 


CPTI] p) ~ — 21> (<p O? | |p>C74(y7)y" . yes 
J,a 


(14.4.8) 


Here the operator (77 „is a local operator of spin J. The label ‘a’ denotes 
different operators of the same spin. Only the symmetric part of the operator 
is relevant, and in order that it be of definite spin, it must be traceless. Since 
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the hadron is unpolarized the matrix element has the form 


(p|O77 a [P> =<p|O"*|p>(p,, --P,, — traces), (14.4.9) 


where <p|(7*|p» is the reduced matrix element — a scalar quantity. The 
normalizations in (14.4.8) have been adjusted for later convenience. 

The leading power of y as y—0 is obtained by using operators of the 
minimum possible dimension. In QCD these are 


Jq = 1-J- š $ è s 
Ou.. ny 72 4Y iD,- 1D, q, symmetrized minus traces, 


of , =—2° 7G, iD,,...iD 


v 7 : 
My... My G, ,„ symmetrized minus traces. 


(14.4.10) 


In accordance with the results of Section 12.8 we have kept only gauge- 
invariant operators. (We use D, to denote the covariant derivative, and q to 
denote the field of a quark of flavor q. Sums over color indices are implicit in 
(14.4.10). Only hermitian operators are needed.) 

Our usual power-countingargumentsimply that the behavior of the scalar 
coefficient C/9(y*) in (14.4.8) is 


(y?)~4™© times logarithms, 


Bjc 


with 
dim (C) = 2dim (j) — dim (03$, )+ J 
= 2dim(j) — dim[< p|O""|p>/<p|p>]. (14.4.11) 
The dimension minus the spin of an operator is evidently the important 
quantity here; it iscalled the twist of the operator. The leading twist 1s two, for 
the operators of (14.4.10), and for them C(y) ~ y~* modulo logarithms. 
Fourier transformation of (14.4.8) now gives 


T(v, Q?) = — 2i È <p|0 "|p C7*(Q)x 7 + non-leading powers of Q, 
J,a 


(14.4.12) 
where we define 


C’*(Q) = 010/00?) faye» co) (14.4.13) 


Perturbative calculations of T (v, Q?) will give us Č“ in (14.4.13). We can then 
obtain moments of F(x, Q) from the dispersion relation: 


2F (Q)= >) <p|0 |p) Č [1 + O(1/Q)], (14.4.14) 


if J is greater than one. 
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14.5 Calculation of Wilson coefficients 


There are two parts to the calculations. The first part is to do low-order 
calculations of the Wilson coefficients 


C’*(g, Q/u), 


where we have now explicitly indicated the dependence on all parameters. 
Higher-order corrections have logarithms of Q/u, so the second part of the 
calculation is to compute the anomalous dimensions and then to do a 
renormalization-group transformation to set u to be of order Q. Thus we 
write 


2F,(Q)= }, <PlO"|P > wC G(Q),1)M.,0(9(Q),2/u). (14.5.1) 


The subscript (4) on the matrix element denotes renormalization with unit of 
mass p. The matrix M is obtained by solving the renormalization group 
equation for C, and the Č” (g(Q), 1) is well approximated by its lowest-order 
term in perturbation theory. Measurements of deep-inelastic scattering at 
one value of Q are enough to give < p|0"*| p>, and then (14.5.1) predicts the 
moments of the structure functions at other values of Q. 


14.5.1 Lowest-order Wilson coefficients 


The Wilson coefficients are independent of the target, so we may calculate 
them with the hadron state replaced by a quark state. In tree approximation 
we have the expansion sketched in Fig. 14.5.1. The scalar ‘current’ j is the 
renormalized operator ZZ,,.q,q; = [4;q; |,for a particular quark flavor i. Since 
the Wilson coefficients are independent of mass, we set quark masses to zero. 
Then from the graphs of Fig. 14.5.1 we find 


pa itl + 9] ite- A] 
2 (p+q) 2 (p—4q)’ 
1/x? 


SA eee 


(14.5.2) 


The factor 5 comes from averaging over the spin of the quark. For a quark of 
any other flavor than i, or for a gluon, we have T= 0 to this order. 


p+q P—q 
} \ + D- Ş xc” Or 
P Jaa 


Fig. 14.5.1. Wilson expansion of <0|Tjj|0> to lowest order. 
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Tocompute the Wilson coefficients we also need the matrix elements of the 
operators defined by (14.4.10). In a quark state with flavor i we have 


CpilOn.y, [PED = bôi tr (DY p, Pus --- Pa, ) 
= Rôi Pu Py,» (14.5.3) 


in tree approximation, with the label i’ denoting a quark flavor ora gluon. For 
a gluon state and gluon operator 


(Pg|On us |P9I> = 2Py, Puy (14.5.4) 
All other matrix elements are zero. Hence the non-zero reduced matrix 
elements are all equal to 2. 
Comparison of (14.5.2)-(14.5.4) with the operator-product expansion 
(14.4.12) shows that 


Či =1 + 0(g?), if J21 and is even, 


(14.5.5) 
Č} = 0(g?), if i’ +i, or if i’ is a gluon. 
14.5.2 Anomalous dimensions 
The anomalous dimension of the operator ZZ,,4;4; ÍS Ym: 
d _ E 
My l mia) = Ymlg)ZZ mgA, (14.5.6) 
and we let y7, (g) be the anomalous dimension matrix of the 0%”s: 
d 
tae = > Yaa (90 E (14.5.7) 


(Operators of different spin do not mix.) As shown in Section 10.5in a simpler 
case, the Wilson expansion then implies a renormalization-group equation 
for the Wilson coefficients: 


ð 0 
È + ps, |e" OM = —Yregl + 7m C% (14.5.8) 


This equation would be trivial to solve were it not that it isa matrix equation. 

The lowest-order counterterms for the operators are generated by the 
graphs of Fig. 14.5.2. It is evident that the different operators mix. To solve 
the RG equation we must diagonalize the anomalous dimension matrix at 
order g’. The first step is to recall that the counterterms are independent of 
quark masses. So the renormalizations respect the SU(n,,) symmetry of the 
flavor space. Therefore let us now choose a new basis for the twist-2 
operators. 
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AAA 
AAA A &s 
AAAA 


019 


Fig. 14.5.2. Lowest-order divergences of the operators in the Wilson expansion of 
<0/Tjj|0>. 


There is a multiplet of non-singlet operators: 

OFNS4—I-TGy iD, iD, Ay, (14.5.9) 
where the As are the n" x n matrices that generate the flavor symmetry. 
There are two singlet operators: 

019, defined by (14.4.10), 


OS SO (14.5.10) 


flav,i 
The renormalizations are 
[0] Z, 1 Z, 2 0 O 
[0] |=|Z,; Za 0 JL Os |, (14.5.11) 
pos] 0 0 Z ys Oo. 


[o.J=> VA Ua 
B 


from which the anomalous dimension matrix 


Yı %2 0 
Vap = |721 Y2 0 (44.5.12) 


0 0 Ys 
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is obtained by 
0 
> VapZpy = Bs Za (14.5.13) 
b ôg 


A calculation of the divergences of the graphs of Fig. 14.5.2 gives (Gross 
(1976)) 


z j J 2 
E oe E 
22 = Ns al JJ +D +455] 


2 J 
, 9g 12 12 1 2 
=2_| f= ee te I 
711 | -D UrDU a lr tz 
_  g’ty (J?+J+2) 
4n? J(J +1)(J +2)’ 


1. g (I? +5+2) 
M2 x II- ` 


(14.5.14) 


14.5.3 Solution of RG equation — non-singlet 


The Wilson coefficients of the non-singlet operators evolve very simply: 


d 
Pdp Cys(g(u), Q/u)=(— INS T ICNS: (14.5.15) 


An approximate solution can be found by taking the one-loop approxi- 
mation for the anomalous dimensions. This gives 


In (Q/A) [Ns ~ Ym J241) 
In (u/A) | 
where y§} , and y? denote the coefficients of g?/4n? in yys(g) and y,,(g), and 
— A, isthe coefficient of g* /(4x”) in B(g). We may replace C’’"5(g(Q), 1) by its 
value in tree approximation. The accuracy of (14.5.16) may be systematically 
improved by taking more terms in the perturbation expansions of B, yys, Ym 
and C. 

The singlet coefficients may be obtained by diagonalizing the 2 x 2 matrix 
of anomalous dimensions. Then there are two linear combinations of singlet 
coefficients that have simple behavior like (14.5.16). 


Chslg(u), Q/u) = Cxs(g(Q), of (14.5.16) 


14.6 OPE for vector and axial currents 


Wewillnow apply theoperator-product expansion to the structure functions 
of T for a weak or electromagnetic current. The argument is a simple 
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generalization of the treatment in Sections 14.4 and 14.5 of deep-inelastic 
scattering with a scalar current. We have 


A —2i [atye E oloi, lP > 
0? oO? 
Hi , Hyp — a Jaj,,2 
x fy ++) ( Jugy? zy JC! (y ) 
ð? ð? ð? ð? 
— ys yi My a H2 = Cia 2 
y y Ç dy? TG dy? 57) 2 (y ) 


, 0 
—1y"?. Teg eC) (14.6.1) 


The derivatives acting on the C}”s are arranged to be in combinations with 
zero divergence, to correspond to the condition q"7T,,=0. Fourier 
transformation as at (14.4.13), with suitable factors of Q7, gives 


Ty ~ — 2i}, <p|O**|p> x 
J,a 


x {(= Gav + 9u4/97)x~ Č (Q) 
+ (1/v)(p, — 4,%/47)(P, — 44¥/47)x?~7C3(Q) 
— 18 vag P” gP (1/2v)x 7 C4(Q)}. (14.6.2) 
The set of leading twist operators is the same as in Section 14.4, and we have 


arranged normalizations so that the C,’s are dimensionless in leading twist. 
Hence the moments of the structure functions satisfy 


F, (Q)+(— 1) F, (0) = X C2(Q)< p|O"*|p) + correction, 


a 


Fz y-(Q)+(— 1) F,,5-(Q) = } C2°(Q)< p|O""|p> + correction, 


F, (Q)+(—1)’*'!F3 (Q) = } C3(Q)< p|O”*|p> + correction. 


a 


(14.6.3) 


These equations are valid for J > 1, and the corrections are of order 1/Q? 
times logarithms. 


14.6.1 Wilson coefficients — electromagnetic case 


To compute the lowest-order Wilson coefficient we consider deep-inelastic 
scattering on a quark target. The graphs are the same as in the case ofa scalar 
current, Fig. 14.5.1, except that the current operator is now the elec- 
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tromagnetic current. We find 


(lowest order) __ 1,2 i(p T d) i(p = q) 
Tay 263 f tr [ør TETE | + a rel 


1/x? 
= aed ae Juv + Ud! Tay 


1 2/x 
v1/x? -1 


+ (P, — 4,¥/q7)(p, — qyv/q7) l (14.6.4) 


Expanding about 1/x = 0 gives 


(0.9) 


T,= —2ie; }, (1/x)"*? + O(97), 


n=0 
vT,/M? = — 4ie? Y (1/x)?"*! + O(g?). (14.6.5) 
n=0 


The Wilson coefficients are therefore 
Clq = e?/2 + O(g’) 
C31 = e; + 0(9°) 
C, =0, 
C39 = 0 + O(g?). (14.6.6) 


The relation Č, = 2Č corresponds to the Callan—Gross relation F, = 2xF 
in the parton model. Since the renormalization-group equation is the same 
for both Ci? and CJ?, the Callan—Gross relation is true in QCD with 
corrections of O(g(Q)7). These corrections are from the Wilson coefficient, 
and have been calculated. See Buras (1981) for an up-to-date list of references. 

The renormalization-group equations are the same as in the scalar case, 
except that y, is replaced by zero since the anomalous dimension of a 
conserved current is zero. 


! if J is even and >2, 


14.7 Parton interpretation of Wilson expansion 


The use of moments in comparing theory and experiment is not very 
convenient, since cross-sections are needed outside the range in which they 
are measured. A more convenient form can be derived in which an expansion 
is obtained for the structure functions themselves. We will just summarize the 
results. More details can again be found in Buras (1981). 

It is sufficient to examine the case of a scalar current for which the 
expansion is 


ld 
F(x,Q)= } | = San WC Az/X; Ql 9H). (14.7.1) 
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The sum is over all species of parton, i.e., flavors of quark and antiquark, and 
gluon. The generalized Wilson coefficient C, is effectively astructurefunction 


for deep-inelastic scattering of a parton state, while f,,(z, 4) is a parton 
distribution. Lowest-order calculations reproduce the parton-model result, 
so that 


C, = Cz= e, ô(z/x — 1)/2, 
C,=0. (14.7.2) 


The renormalization group equation for f has the form 


d ! d 
May ajn(X; 4) = 2 | — Prlz x, 9) Í sın (Z, H). (14.7.3) 


This is called the Altarelli—Parisi (1977) equation — it was first derived by 
these authors in leading logarithmic approximation from an heuristic 
argument. Later derivations (Collins & Soper (1982a), and Curci, Furmansk1 
& Petronzio (1980)) are more complete. 

Integro-differential equations like (14.7.3) are not particularly easy to 
work with. One mathematical simplification that can be made is to take 
moments, with the result that the operator-product expansion of 
Sections 14.5 and 14.6 is recovered. This will enable us to see that the two 
methods are essentially equivalent. However, the mathematically more 
complicated method using convolutions, as in (14.7.3), gives more physical 
insight, can be extended to other processes (Buras (1981)), and can be used 
without knowing structure functions at small x. 

To see the equivalence of the two methods, let us define the following 
moments: 


o (u) = | e, ain(25 H), 
cP = | d(x/z)(x/z)"7 *C,(z, x; Q/u, g(u)), 
(0) 


1 
Ves =| d(x/z)(x/z)"~ * salz, x; g(H)). (14.7.4) 
0 
Then (14.7.1) implies 


F,(Q) = X S ana) C™(g, Q/y). (14.7.5) 


This expansion has the same form as (14.4.14). In fact, moments of the f’s are 
the same as the matrix elements of the twist-2 operators: 


<N|O”|N> = | dx x7 ~ HE fan) + Sanl) ], (14.7.6) 
0 
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where there is a contribution from each type of parton and antiparton. This 
equation can be proved (Collins & Soper (1982a), and Curci, Furmanski & 
Petronzio (1980)) provided only that the same renormalization prescription 
(e.g., minimal subtraction) is used for both the operators and for the parton 
distributions. 


14.8 W, and W, 


So far we have ignored F, and F.. They are zero if the current j“ in W, is 
conserved. But the weak-interaction currents are not conserved if quark 
masses are non-zero: 


ODU — y5)2Y) = UL, 4°] — y5{m, 2°} 
= iD*/2. (14.8.1) 


Here mis the quark mass matrix. It follows that W, and W, are non-zero in 
neutrino scattering. However, we should regard the ratio m/Q as setting the 
scale for their effect on the cross-section (Jaffe & Llewellyn-Smith (1973) and 
Llewellyn-Smith (1972)). They therefore give a small contribution to the 
deep-inelastic cross-section. Since W, and W, are therefore non-leading in 
the Bjorken limit, they are difficult to compute directly. 

A convenient technique to compute W, and W, is to consider 


q"W „ =(q?>W, + p-qWs/2)q,/M? + q°p,W5/2M°. (14.8.2) 


We have the operator formula 


Way = ese""<pLio20LD». (14.8.3) 


so wecancompute W, and W, by making an operator-product expansion for 
j,(y)D(0). Since there is an explicit factor of quark mass min the expression for 
D, this operator behaves as a dimension 3 rather than adimension 4 operator. 
This suppresses the Wilson coefficients for W, and W, by a power of Q°. 
Since, in(14.1.10), the tensors multiplying W, and W, haveaq, or q,inthem,a 
similar suppression occurs because the lepton masses are much less than Q, as 
is seen by contracting q, or q, with the lepton tensor (14.1.7). The result is that 
W, and W, make a negligible contribution to the cross-section at large Q. 
A detailed treatment of W, and W, within QCD can be made but has not 
yet been published. It generalizes the results of Jaffe & Llewellyn-Smith 
(1973), who worked within the parton model. 
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